Florentin Smarandache & Jean Dezert 


(Editors) 


Advances and Applications of DSmT 
for Information Fusion 


(Collected works) 


Decision-making 


Hybrid DSm rule for hybrid model M(0) 
VAE DO,  mayoy(A) = 4(A) [muro (4) + 824) + S3(4)] 


Introduction of integrity constraints into DO 
Hybrid model M(0) 


Classic DSm rule based on free model Mf (O) 


(X1N...NX)=A 


m(.):D° = [0,1] | | i | mg(.) : D® — [0,1] 


Source $; : : Source sz 





American Research Press 
Rehoboth 
2004 


This book can be ordered in a paper bound reprint from: 


Books on Demand 

ProQuest Information & Learning 

(University of Microfilm International) 

300 N. Zeeb Road 

P.O. Box 1346, Ann Arbor 

MI 48106-1346, U.S.A. 

Phone.: 1-800-521-0600 (Customer Service) 

http: //wwwlib.umi.com/bod/ 

and also be ordered through Amazon.com or downloaded from the web site: 


http: //www.gallup.unm.edu/~smarandache/DSmT-book1 . pdf. 


Copyrights 2004 by The Editors, The Authors for their articles and American Research Press, Rehoboth, 
Box 141, NM 87322, USA. 
Download books from the Digital Library 


http: //www.gallup.unm.edu/~smarandache/eBooks-otherformats.htm 


This book has been peer reviewed and recommended for publication by: 
1 - Professor Krassimir Atanassov 

Centre of Biomedical Engineering (CLBME) 

Bulgarian Academy of Sciences 


Bl. 105 Acad. G. Bontchev Str., 1113 Sofia, Bulgaria 


2 - Professor Bassel Solaiman 
Ecole Nationale Supérieure des Télécommunications de Bretagne 
Image and Information Processing Department 


Technopóle Brest-Iroise - CS 83818 - 29238 Brest Cedex 3, France 


3 - Professor Pierre Valin 
Physics Department & Centre de Recherches Mathématiques 
University of Montréal 


C.P. 6128, succ. Centre-ville, Montréal, H3C 3J7, Canada 


ISBN: 1-931233-82-9 


Standard Address Number 297-5092 


Printed in the United States of America 


Contents 


Preamble xi 
Prefaces xiii 


I Advances on DSm 1 
1_ Presentation of DSm 














3 

AP 3 
o ones 5 
5 

5 

6 

10 

11 

11 

13 

eneralized belief functiong . we 15 

16 

17 

18 


1.4 Comparison of different rules of combinationg ............ e 2 eee ee ee 21 
LAI- Firstexampld ico ee we be PA Swe aa we BR SES ee 21 
1.4.2 Second example 









eit, Mure AA te ee a eine Se he Oe ee 25 
pas een aeons 26 
apie bedi pc A 27 

DS ht A a a NS O a a 27 


37 
37 
38 
38 
39 
39 
40 
42 
45 
46 
48 


49 
49 
51 
51 
52 
56 


3.3 Conclusio 59 
3.4 References 60 


A 61 
E E 62 


4.3.1 Definition of the free-DSm model M/(Q) 


4.3.2 Example of a free-DSm modell... 2... a a 63 




















4.4.5 Example 4: Shafer’s model .......... 0000000000000 00004 67 








obese Ge gg Oe Sue ge eae eens: 105 
abia. 106 
ab nea. 106 
A: 110 
ios iv is Rbla 114 
A 115 
es ep endo ences 






















er ee 17 
EIEEE EE 17 
ee ee ore 118 

















General DSm rule of combinatio 









6.2.1 








622- A fic ss as sla at by Bad wk ik Hoe ow BW oe wh cde eek be SS Oh Ae we eek Bo A 126 


ites 127 
A g5ag E et 130 
a! 135 
bei 136 

















A Di AAA Y 


132 Pt. 





is a probability measurgd .... sooo e a 146 


7.4 Some examples for the GP... 148 













155 





Sl Introductionis ss eea bang ke ay de Whee ak AE Ge o oh Meow deh Ue UE ele eke Ge ee ee G 155 


oia oo aro 168 
8.5.1 A possible modal interpretatio 
a e bates 169 
861 Modal lot. n e i ay eG Bw G ea A eee e ee a Re 171 
nio ro tenner eee 175 
serios el 176 
AA 


8.7.1 — Definitiong’ plo asc daa a a a we 
8.7.2 Properties 























O a a a a a ana a aa E 186 

8.8- Conclusion) (vs a a a A & 189 
8.9 — Referenced sae ek a ear a A ee ee Pek SA a ae 190 

9 On conjunctive and disjunctive combination rules of evidence 193 
9:1. Introductions 34004 olas ee A we we ee a ee Ee OY Se we N 194 


9.2 Preliminar a a ee as a AA ad e ee Me ee as a ae 









RA 208 
ere rere 211 
A 216 
tan aci ea 218 


o N 223 
A 225 
A 225 
pa spas: 229 
AA 231 
PA ere 

































243 





11 
A a e oe dh how bad a A ew 244 

A n th Gh SAE edd, A A E es Pats edn ee Are ik 245 

prats a atin Pate tele ces, oe 247 
a GE ds PT eH ae RR. Dd sen ede ede Angel ae ts 248 


A We Reet ae a e a Go RO eae A Hayle eG 253 


11.3.1 Independence-interdependencd .........0 0.000 ee 256 
11.3.2 T-norm descriptio 






E AE 256 
iA Conclusión. oca dra as o a de A o Ma ae Gt da o 258 
11.5 Refereniced eaea i a a da Da do id a A ae id a ed 260 





II Applications of DSm 263 


265 
A 265 
ere 266 
osea 267 
A 267 
eran E 269 
rear 274 
osado 281 


12. onclusión ss iS ke AS ea ral a a Be a as A 286 


12:7 Refereniced ic a A aaa a A rd a 287 


13 Estimation of Target Behavior Tendencies using DSm 
134 Introductions cocotero Ee ae N 290 
13.2 Statement of the Problem]... o a a a a ee 







































13.3.1 The fuzzification interfacd. . 2... a a a a a 291 
13.3.2 The behavior mode] .. 0... a 










14.3 The Attribute Contribution to GDA]... 2. a 306 





15.2.1 Association Problem no. M....... a L 


15.2.2 Association Problem no. 2 
















BOS Ge be a E e A oa a G 326 

15.3 Attempts for solutiong . ross arauren k saaa ae w a a a a a a a 327 
15.3.1 The simplest appraa ll ee 327 
sete Saas sentence es 327 


15.3.3 Blackman’s approach: e ea i s aren a a AOR RA es a aaa bara bow a 





ena 

IRA 348 
AA 352 
o iria hae nese 353 
in OEE 354 
ee etsy E T E TE 355 
A N 












ren er 372 
ee ew 373 
iia tes 373 
ree 373 


375 
it ese in OW ea A Ga ae nk Sect hoa wale ar aide Beles ew 375 


371 











et Bo entree dots a oe ae ee 377 






ion with DSmT] .. 2. ee 377 





over predi 





378 
ERG Be Be Tia ke ae BH SYA nw ee EG ee RL Ge oe HK 379 
AS 381 

pe eea A RR OR AS OS ee WA ace Fe Se me A K 381 





A Hi 

A 387 
ore ape us 389 
E 390 
evo ni us was tee ee oe ates 391 
A 391 

















A oe 399 
18.4 CONCIUUSIONS +... a A a a ee a 408 
18.5 Referenced ss 2... GS RE a a a we 408 


Biographies of contributors 411 


Preamble 


his book is devoted to an emerging branch of Information Fusion based on new approach for model- 
T the fusion problematic when the information provided by the sources is both uncertain and 
(highly) conflicting. This approach, known in literature as DSmT (standing for Dezert-Smarandache 
Theory), proposes new useful rules of combinations. We gathered in this volume a presentation of DSmT 
from the beginning to the latest development. Part 1 of this book presents the current state-of-the-art on 
theoretical investigations while Part 2 presents several applications of this new theory. We hope that this 
first book on DSmT will stir up some interests to researchers and engineers working in data fusion and in 
artificial intelligence. Many simple but didactic examples are proposed throughout the book. As a young 
emerging theory, DSmT is probably not exempt from improvements and its development will continue to 
evolve over the years. We just want through this book to propose a new look at the Information Fusion 


problematic and open a new track to attack the combination of information. 
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Dr. Pavlina Konstantinova, Dr. Albena Tchamova, Dr. Hongyan Sun, Samuel Corgne, Dr. Frédéric 
Dambreville, Dr. Milan Daniel, Prof. Denis de Brucq, Prof. Mohamad Farooq, Dr. Mohammad 
Khoshnevisan, Patrick Maupin, Dr. Grégoire Mercier and Prof. Tzvetan Semerdjiev for their contri- 
butions to this first volume and their interests and support of these new concepts. We encourage all 
researchers interested in Information Fusion and by DSmT to contribute with papers to a second volume 
which will be published a few years later. This field of research is promising and currently very dynamic. 


Any comments, criticisms, notes and articles are welcome. 
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and Pierre Valin for accepting to serve as peer-reviewers for this book. 
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The Editors 


Prefaces 


dvances in science and technology often result from paradigm shifts. In the 1910’s, Einstein tried 
A: reconcile the notion of absolute space and time of Cartesian dynamics, with Maxwell’s electro- 
dynamic equations, which introduced an absolute speed for light in vacuum. Addressing this dilemma 
inevitably lead him to put space and time on an equal footing, for any observer in an inertial frame, 
and special relativity was born. When he then tried to include gravitation in the picture, space and 
time became warped by mass (or energy) and general relativity emerged by connecting locally inertial 
frames. In each case, a new theory arose from relaxing assumptions, which formerly were thought to 
be immutable. We all know now the ideal regions of applicability of Cartesian dynamics (slow-moving 
objects) compared to those of special relativity (fast moving objects) and general relativity (cosmology 
and strong gravitational fields). However general relativity can reduce to special relativity, which itself 
can become Cartesian dynamics in everyday life. The price to pay in going from Cartesian dynamics to 


the more general formulations of relativity is increasing complexity of the calculations. 


In his classic 1976 book, Shafer stated the paradigm shift, which led him to formulate an alternative 
to the existing Bayesian formalism for automated reasoning, thus leading to what is commonly known as 
Dempster-Shafer (DS) evidential reasoning. The basic concept was that an expert’s complete ignorance 
about a statement need not translate into giving 1/2 a probability to the statement and the other 1/2 to its 
complement, as was assumed in Bayesian reasoning. Furthermore, when there are several possible single 
mutually exclusive alternatives (singletons) and the expert can only state positively the probabilities of 
a few of these, the remaining probabilities had to be distributed in some a priori fashion amongst all 
the other alternatives in Bayesian reasoning. The complete set of all the N alternatives (the frame of 
discernment) had to be known from the outset, as well as their natural relative frequency of occurrence. 
By allowing as an alternative that the ignorance could be assigned to the set of all remaining alternatives 
without any further dichotomy, a new theory was thus born that reasoned over sets of alternatives, DS 


theory. 


xiii 


Clearly the problem became more complex, as one had to reason over 2% alternatives, the set of all 
subsets of the N singletons (under the union operator). When Dempster’s orthogonal sum rule is used 
for combining (fusing) information from experts who might disagree with each other, one obtains the 
usual Dempster-Shafer (DS) theory. The degree of disagreement, or conflict, enters prominently in the 
renormalization process of the orthogonal sum rule and signals also when DS theory should be used with 
extreme caution: the conflict must not be too large. Indeed several paradoxes arise for highly conflicting 
experts (sources), and these have to be resolved in some way. Going back to relativity for a moment, the 
twin paradox occurs when one tries to explain it with special relativity, when actually it is a problem 
that has to be handled by general relativity. A paradigm shift was necessary and one will be needed here 
to solve the paradoxes (referred to in this book as counter-examples) of DS theory: the relaxation of an 
a priori completely known frame of discernment made of mutually exclusive singletons, and this is what 


Dezert-Smarandache (DSm) theory is basically all about. 


In the first part of this book, DSm theory is motivated by expanding the frame of discernment to 
allow for presumed singletons in DS (or Bayesian) theory to actually have a well-defined intersection, 
which immediately states when this theory should be used: whenever it is impossible to estimate at the 
outset the granularity required to solve the problem at hand, either by construction (fuzzy concepts which 
cannot be refined further), or when the problem evolves in time to eventually reveal a finer granularity 
than originally assumed. It would then be important to continue being able to reason, rather than to go 


back and expand the frame of discernment and start the reasoning process over again. 


However, clearly the problem again becomes more complex than DS theory, as one has to reason now 
over more alternatives (following Dedekind’s sequence of numbers as N increases), consisting of the set 
of all subsets of the N original singletons (but under the union and the intersection operators). This is 
still less than would be required for a refined DS theory (if possible), which would consist of 2 to the 
power 2% — 1 alternatives. The classic DSm rule of combination ensures the desired commutativity and 
associativity properties, which made DS theory viable when the original orthogonal sum rule is used. 
This classic DSm rule is particularly simple and corresponds to the Free DSm model. Because the classic 
DSm rule does not involve a renormalization depending on the conflict, it will not exhibit the problems 
of DS theory under highly conflicting conditions. However since one of the applications of DSm theory 
involves dealing with problems with dynamic constraints (elements can be known not to occur at all at 
a certain time), a hybrid rule of combination is also proposed which deals with exclusivity constraints 
as well (some singletons are known to be truly exclusive). One can think of many examples where such 
available knowledge fluctuates with time. In this first part, the authors make a special effort to present 


instructive examples, which highlight both the free DSm model and the hybrid DSm model with exclusiv- 


ity and/or non-existential constraints. The classic counter-examples to DS theory are presented, together 


with their solution in DSm theory. 


In the second part of the book, data/information fusion applications of DSm theory are presented, 
including the Tweety Penguin triangle, estimation of target behavior tendencies, generalized data associ- 
ation for multi-target tracking in clutter, Blackman’s data association problem, neutrosophic frameworks 
for situation analysis, land cover change detection from imagery, amongst others. This second part of 
the book is much more of an applied nature than the theoretical first part. This dual nature of the book 
makes it interesting reading for all open-minded scientists/engineers. Finally, I would like to thank the 


authors for having given me the opportunity to peer-review this fascinating book. 


Pierre Valin, Prof., Ph.D. 
Dept. de Physique 
Université de Montréal 
Montréal, Québec, Canada 


May, 2004 


his book presents the foundations, advances and some applications of a new theory of paradoxical 
al bw plausible reasoning developed by Jean Dezert and Florentin Smarandache, known as DSmT. 
This theory proposes a general method for combining uncertain, highly conflicting and imprecise data, 
provided by independent sources of information. It can be considered as a generalization of classical 
Dempster-Shafer mathematical theory of evidence, overcoming its inherent constraints, closely related 
with the acceptance of the law of the excluded middle. Refuting that principle, DSmT proposes a formal- 
ism to describe, analyze and combine all the available information, allowing the possibility for paradoxes 
between the elements of the frame of discernment. It is adapted to deal with each model of fusion occur- 
ring, taking into account all possible integrity constraints of the problem under consideration, due to the 
true nature and granularity of the concepts involved. This theory shows through the considered appli- 
cations that conclusions drawn from it provides coherent results, which agree with the human reasoning 


and improves performances with respect to Dempster-Shafer Theory. 


Krassimir Atanassov, Prof., Ph.D. 
Centre of Biomedical Engineering 
Bulgarian Academy of Sciences 
Sofia, Bulgaria 

May, 2004 


Ss advancement has always been through achievements, ideas and experiences accumulation. 
New ideas and approaches sometimes suffer misunderstanding and sometimes from a kind of “rejec- 
tion” because they disturb existing approaches and, humans do not easily accept the changes. Simply, 


this is the human being history. 


Information processing domain is not an exception. While preparing this preface, I remembered what 
happened when the fuzzy sets theory was developed. In the 1970’s, some said “Fuzzy logic is the opium 
of sciences”! Amazing to see how things have changed since that time and how fuzzy sets theory is now 


well accepted and so well applied. 


The scientific area of Information Fusion is beautifully “disturbing” our ways of thinking. In fact, 
this area imposes important questions: What is information? What is really informative in information? 
How to make information fusion? etc. From my own point of view, this area is pushing the scientific 
community towards promising approaches. One of these approaches is raised by Florentin Smarandache 
& Jean Dezert in their book: Advances and Applications of DSmT for Information Fusion. This approach 
aims to formalize the fusion approach in the very particular context of uncertain and highly conflicting 
information. The Dezert-Smarandache Theory (DSmT) should be considered as an extension of the 
Dempster-Shafer (DS) as well as the Bayesian theories. From a technical point of view, the fundamental 
question concerning the granularity of the singletons forming the frame of discernment is clearly raised. 
The book is not only limited to theoretical developments but also presents a set of very interesting ap- 


plications, making thus, its reading a real pleasure. 


I would like to thank the authors for their original contribution and to encourage the development of 


this very promising approach. 


Bassel Solaiman, Prof., Ph.D. 
ENST Bretagne 
Brest - France 


May, 2004 


Part I 


Advances on DSmT 


Chapter 1 


Presentation of DSmT 


Jean Dezert Florentin Smarandache 
ONERA Department of Mathematics 
29 Av. de la Division Leclerc University of New Mexico 
92320 Chatillon Gallup, NM 8730 
France U.S.A. 


Abstract: This chapter presents a general overview and foundations of the DSmT, 
i.e. the recent theory of plausible and paradoxical reasoning developed by the au- 
thors, specially for the static or dynamic fusion of information arising from several 
independent but potentially highly conflicting, uncertain and imprecise sources of 
evidence. We introduce and justify here the basis of the DSmT framework with 
respect to the Dempster-Shafer Theory (DST), a mathematical theory of evidence 
developed in 1976 by Glenn Shafer. We present the DSm combination rules and 
provide some simple illustrative examples and comparisons with other main rules of 
combination available in the literature for the combination of information for sim- 
ple fusion problems. Detailed presentations on recent advances and applications of 


DSmT are presented in the next chapters of this book. 


1.1 Introduction 


he Dezert-Smarandache Theory (DSmT) of plausible and paradoxical reasoning proposed by the 
Ta in recent years can be considered as an extension of the classical Dempster-Shafer 
theory (DST) [33] but includes fundamental differences with the DST. DSmT allows to formally combine 
any types of independent sources of information represented in term of belief functions, but is mainly 
focused on the fusion of uncertain, highly conflicting and imprecise sources of evidence. DSmT is able 


to solve complex static or dynamic fusion problems beyond the limits of the DST framework, specially 
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when conflicts between sources become large and when the refinement of the frame of the problem under 
consideration, denoted ©, becomes inaccessible because of the vague, relative and imprecise nature of 


elements of © [O]. 


The foundation of DSmT is based on the definition of the Dedekind’s lattice DÌ also called hyper- 
power set of the frame O in the sequel. In the DSmT framework, © is first considered as only a set 
{61,...,0n} of n exhaustive elements (closed world assumption) without introducing other constraint 
(exclusivity or non-existential constraints). This corresponds to the free DSm model on which is based 
the classic DSm rule of combination. The exhaustivity (closed world) assumption is not fundamental 
actually, because one can always close any open world theoretically, say Oopen by including into it an 
extra element /hypothesis 69 (although not precisely identified) corresponding to all missing hypotheses 
of Oopen to work with the new closed frame O = {00} U Oopen = (00,01,...,0n). This idea has been 
already proposed and defended by Yager, Dubois & Prade and Testemale in and differs from 
the Transferable Belief Model (TBM) of Smets [42]. The proper use of the free DSm model for the fusion 
depends on the intrinsic nature of elements/concepts 6; involved in the problem under consideration 
and becomes naturally justified when dealing with vague/continuous elements which cannot be precisely 
defined and separated (e.g. the relative concepts of smallness/tallness, pleasure/pain, hot/cold, colors 
(because of the continuous spectrum of the light), etc) so that no refinement of © in a new larger set 


O»ef of exclusive refined hypotheses is possible. In such case, we just call O the frame of the problem. 


When a complete refinement (or maybe sometimes an only partial refinement) of O is possible and 
thus allows us to work on Oef, then we call O,.; the frame of discernment (resp. frame of partial 
discernment) of the problem because some elements of Oe; are truly exclusive and thus they become 
(resp. partially) discernable. The refined frame of discernment assuming exclusivity of all elements 6; € O 
corresponds to the Shafer’s model on which is based the DST and can be obtained from the free DSm 
model by introducing into it all exclusivity constraints. All fusion problems dealing with truly exclusive 
concepts must obviously be based on such model since it describes adequately the real and intrinsic nature 
of hypotheses. Actually, any constrained model (including Shafer’s model) corresponds to what we called 
an hybrid DSm model. DSmT provides a generalized hybrid DSm rule of combination for working with 
any kind of hybrid models including exclusivity and non-existential constraints as well and it is not only 
limited to the most constrained one, i.e. Shafer’s model (see chapter Ø] for a detailed presentation and 
examples on the hybrid DSm rule). Before going further into this DSmT presentation it is necessary to 
briefly present the foundations of the DST [33] for pointing out the important differences between these 


two theories for managing the combination of evidence. 
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1.2 Short introduction to the DST 


In this section, we present a short introduction to the Dempster-Shafer theory. A complete presentation 
of the Mathematical Theory of Evidence proposed by Glenn Shafer can be found in his milestone book 
in [33]. Advances on DST can be found in [34] [48] and [49]. 


1.2.1 Shafer’s model and belief functions 


Let O = [0,,02,...,0, ) be the frame of discernment of the fusion problem under consideration having n 
exhaustive and exclusive elementary hypotheses 6;. This corresponds to Shafer’s model of the problem. 
Such a model assumes that an ultimate refinement of the problem is possible (exists and is achievable) 
so that 0, are well precisely defined /identified in such a way that we are sure that they are exclusive and 


exhaustive (closed-world assumption). 


The set of all subsets of O is called the power set of © and is denoted 2°. Its cardinality is 2!°!. Since 


2° is closed under unions, intersections, and complements, it defines a Boolean algebra. 
By example, if O = {61, 02, 03) then ge = (0, 01, 02, 03, 01 U 02, 01 U 03, 02 U 03, 01 U 0, U 93). 


In Shafer’s model, a basic belief assignment (bba) m(.) : 22 — [0,1] associated to a given body of 
evidence B (also called corpus of evidence) is defined by [33] 


m(0) =0 and Y" m(A) =1 (1.1) 


AE29 


Glenn Shafer defines the belief (credibility) and plausibility functions of A C © as 


Bel(4)= X` m(B) (1.2) 
Be2°,BCA 
P(A)= Y  m(B)=1-Bel(A) (1.3) 


BE€22,BNAX 


where A denotes the complement of the proposition A in O. 


The belief functions m(.), Bel(.) and Pl(.) are in one-to-one correspondence [33]. The set of elements 
A € 2° having a positive basic belief assignment is called the core/kernel of the source of evidence under 


consideration and is denoted K(m). 


1.2.2 Dempster’s rule of combination 


Let Bel; (.) and Belz(.) be two belief functions provided by two independent (and a priori equally reliable) 


sources/bodies of evidence Bı and B2 over the same frame of discernment O and their corresponding 
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bba m1(.) and ma(.). Then the combined global belief function denoted Bel(.) = Bel: (.) O Bel2(.) is 
obtained by combining the bba m4(.) and ma(.) through the following Dempster rule of combination [33] 


m(.) = [mi 9 ma](.) where 


m(0) =0 
Y" mi(X)ma(Y) 
(4) - A e a 
m = — € 
1- Y mi(X)ma(¥) 
Xx Y E29 
xnY=0 


m(.) is a proper basic belief assignment if and only if the denominator in equation (L4) is non-zero. 


The degree of conflict between the sources B¡ and Ba is defined by 


kyo 2 Y mi(X)ma(Y) (1.5) 

x,Ye29 

xnY=0 
The effect of the normalizing factor 1 — kı2 in (LA) consists in eliminating the conflicting pieces 
of information between the two sources to combine, consistently with the intersection operator. When 
k12 = 1, the combined bba m(.) does not exist and the bodies of evidences By and Bg are said to be in 
full contradiction. Such a case arises when there exists A C O such that Bel, (A) = 1 and Belg(A) = 1. 
The core of the bba m(.) equals the intersection of the cores of mı and ma, i.e K(m) = K(m1) N K(ma). 
Up to the normalization factor 1— k12, Dempster’s rule is formally nothing but a random set intersection 
under stochastic assumption and it corresponds to the conjunctive consensus [13]. Dempster’s rule of 
combination can be directly extended for the combination of N independent and equally reliable sources of 


evidence and its major interest comes essentially from its commutativity and associativity properties [33]. 


A recent discussion on Dempster’s and Bayesian rules of combination can be found in [5]. 


1.2.3 Alternatives to Dempster’s rule of combination 


The DST is attractive for the Information Fusion community because it gives a nice mathematical model 
for the representation of uncertainty and it includes Bayesian theory as a special case [33] (p. 4). Although 
very appealing, the DST presents some weaknesses and limitations [27] already reported by Zadeh 
[50 521 [53] and Dubois & Prade in the eighties and reinforced by Voorbraak in because of the 
lack of complete theoretical justification of Dempster’s rule of combination, but mainly because of our 
low confidence to trust the result of Dempster’s rule of combination when the conflict becomes important 
between sources (i.e. kig / 1). Indeed, there exists an infinite class of cases where Dempster’s rule of 
combination can assign certainty to a minority opinion (other infinite classes of counter-examples are 
discussed in chapter [5) or where the ”ignorance” interval disappears forever whenever a single piece of 


evidence commits all its belief to a proposition and its negation [29]. Moreover, elements of sets with 
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larger cardinality can gain a disproportionate share of belief [43]. These drawbacks have fed intensive 


debates and research works for the last twenty years: 


e either to interpret (and justify as best as possible) the use of Dempster’s rule by several approaches 
and to circumvent numerical problems with it when conflict becomes high. These approaches are 
mainly based on the extension of the domain of the probability functions from the propositional 
logic domain to the modal propositional logic domain [31] [82] [28] or on the hint model [22] and 
probabilistic argumentation systems [14] [3 0A 16) [17] (18) [19] 20]. Discussions on these interpre- 
tations of DST can be found in [38] [40] [42], and also in chapter[QJof this book which analyzes and 
compares Bayesian reasoning, Dempster-Shafer’s reasoning and DSm reasoning on a very simple 


but interesting example drawn from [28]. 


e or to propose new alternative rules. DSmT fits in this category since it extends the foundations of 


DST and also provides a new combination rules as it will be shown in next sections. 


Several interesting and valuable alternative rules have thus been proposed in literature to circumvent 
the limitations of Dempster’s rule of combination. The major common alternatives are listed in this 
section and most of the current available combination rules have been recently unified in a nice gen- 
eral framework by Lefèvre, Colot and Vanoorenberghe in [25]. Their important contribution, although 
strongly criticized by Haenni in but properly justified by Lefevre et al. in [26], shows clearly that 
an infinite number of possible rules of combinations can be built from Shafer’s model depending on the 
choice for transfer of the conflicting mass (i.e. ki2). A justification of Dempster’s rule of combination 
has been proposed afterwards in the nineties by the axiomatic of Philippe Smets 57] [24] [41] [42] based 
on his Transferable Belief Model (TBM) related to anterior works of Cheng and Kashyap in [6], a non- 


probabilistic interpretation of Dempster-Shafer theory (see [3] [A] for discussion). 


Here is the list of the most common rules of combinatioil for two independent sources of evidence 
proposed in the literature in the DST framework as possible alternatives to Dempster's rule of combination 


to overcome its limitations. Unless explicitly specified, the sources are assumed to be equally reliable. 


e The disjunctive rule of combination [II] [131 [39]: This commutative and associative rule pro- 
posed by Dubois & Prade in 1986 and denoted here by the index U is examined in details in chapter 
mu(.) is defined VA € 2% by 


mu (0) =0 
mu(A)= Y mi(X)m(¥)  v(A4 0) € 2° (1.6) 
or 


The MinC rule of combination is not included here since it is covered in details in chapter [IQ] 
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The core of the belief function given by my equals the union of the cores of Bel; and Belg. This rule 
reflects the disjunctive consensus and is usually preferred when one knows that one of the source 


Bı or Ba is mistaken but without knowing which one among 6, and Ba. 


e Murphy’s rule of combination [27]: This commutative (but not associative) trade-off rule, 
denoted here with index M, drawn from [46] [13] is a special case of convex combination of bba mı 
and mz and consists actually in a simple arithmetic average of belief functions associated with mı 


and mg. Belar(.) is then given VA € 29 by: 


Belm (4) = 5 [Bel (4) + Bela(A)] (1.7) 


e Smets’ rule of combination [41] [42]: This commutative and associative rule corresponds actually 
to the non-normalized version of Dempster’s rule of combination. It allows positive mass on the 
null/empty set Ø. This eliminates the division by 1 — k12 involved in Dempster’s rule (C4). Smets’ 
rule of combination of two independent (equally reliable) sources of evidence (denoted here by index 
S) is given by: 

ms (0) =ki2= Y mi(X)ma(Y) 


xX,YeE2° 
XnY=0 (1.8) 


ms(4)= Y] m(X)mY) WAZNE: 
X,Ye2° 
XNY=A 
e Yager’s rule of combination [45] [46] [17]: Yager admits that in case of conflict the result is not 
reliable, so that k:12 plays the role of an absolute discounting term added to the weight of ignorance. 


The commutative (but not associative) Yager rule, denoted here by index Y is giver|4] by: 


my (0) =0 
my(A)= Y mi(X)m2(¥) VA €2°,A40,AZ4O 
X,Y €29 (1.9) 
XNY=A 
my(®) = mi(®)m2(0)+ Y mi(X)ma(Y) when A=0 
X,Y €29 
xnY=0 


e Dubois & Prade's rule of combination [13]: We admit that the two sources are reliable when 
they are not in conflict, but one of them is right when a conflict occurs. Then if one observes a value 
in set X while the other observes this value in a set Y, the truth lies in X NY as long XAY #9. 
If XAY = f, then the truth lies in XUY [13]. According to this principle, the commutative (but 


28 represents here the full ignorance 01 U 02 U... U Ôn on the frame of discernment according the notation used in B3]. 


1.2. SHORT INTRODUCTION TO THE DST 9 


not associative) Dubois & Prade hybrid rule of combination, denoted here by index DP, which is 


a reasonable trade-off between precision and reliability, is defined by: 


mpp(Q) =0 

mpp(A)= Y mi(X)m(¥)+ Y mi(X)m(¥) VAE22,4%0 (1.10) 
X,Y €29 X,Y e22 
XNY=A XUY=A 
XNY HO xnY=0 


1.2.3.1 The unified formulation for rules of combinations involving conjunctive consensus 


We present here the unified framework recently proposed by Lefèvre, Colot and Vanoorenberghe in to 
embed all the existing (and potentially forthcoming) combination rules involving conjunctive consensus 
in the same general mechanism of construction. Here is the principle of their general formulation based 


on two steps. 
e Step 1: Computation of the total conflicting mass based on the conjunctive consensus 


ki 2 Y mi(X)ma(Y) (1.11) 


X,Y €22 
xnY=0 


e Step 2: This step consists in the reallocation (convex combination) of the conflicting masses on 


(A 40) C O with some given coefficients 0, (4) € [0,1] such that > ace Wm(A) = 1 according to 


m(0) = wm (0)k12 
m(A) = | > m(X)ma(Y)] + Wm(A)kı2 V(A#)E 99 (1.12) 


X,Y e22 
XNY=A 


The particular choice of the set of coefficients w,, (.) provides a particular rule of combination. Actually 
this nice and important general formulation shows there exists an infinite number of possible rules of 
combination. Some rules are then justified or criticized with respect to the other ones mainly on their 
ability to, or not to, preserve the associativity and commutativity properties of the combination. It 
can be easily shown in that such general procedure provides all existing rules involving conjunctive 


consensus developed in the literature based on Shafer’s model. As examples: 


e Dempster’s rule of combination (L4 can be obtained from by choosing VA # 0) 


1 


wm(0)=0 and  wm(A)= ==; 
— kia 





Y" mi(X)ma(Y) (1.13) 


X,Y e22 
XNY=A 


3taking into account the the correction of the typo error in formula (56) given in [13], page 257. 
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e Yager’s rule of combination (L9) is obtained by choosing 
Wm(O) = 1 and Wm(A # 90) =0 (1.14) 
e Smets’ rule of combination (L8) is obtained by choosing 


Wm(0) = 1 and wm(A #0) =0 (1.15) 


e Dubois and Prade's rule of combination (LIO) is obtained by choosing 





1 
VACP,  wm(A) = > E (1.16) 
A1,42/141U42=A 


A¡nNA2= 


where m* £ m;,(41)m2(42) corresponds to the partial conflicting mass which is assigned to A1 U A2. 


P is the set of all subsets of 22 on which the conflicting mass is distributed. P is defined by 


P 2 LA € 28 | JA: € KC(m1), 342 € KC(ma), Az U Ao = Aand Aj N Ao = 0) (1.17) 








The computation of the weighting factors wm(A) of Dubois and Prade’s rule of combination does 
not depend only on propositions they are associated with, but also on belief mass functions which 
have cause the partial conflicts. Thus the belief mass functions leading to the conflict allow to 
compute that part of conflicting mass which must be assigned to the subsets of P [25]. Yager’s rule 


coincides with the Dubois and Prade’s rule of combination when P = {0}. 


1.2.4 The discounting of sources of evidence 


Most of the rules of combination proposed in the literature are based on the assumption of the same 
reliability of sources of evidence. When the sources are known not being equally reliable and the reliability 
of each source is perfectly known (or at least has been properly estimated when it’s possible [42] [25]), 
then is it natural and reasonable to discount each unreliable source proportionally to its corresponding 
reliability factor according to method proposed by Shafer in [33], chapter 11. Two methods are usually 


used for discounting the sources: 


e Classical discounting method [33] [13] [42] 25]: 


Assume that the PEA! factor a € [0,1] of a source is known, then the discounting 
of the bba m(.) provided by the unreliable source is done to obtain a new (discounted) bba m’(.) 


as follows: 


m(A)=a-m(A),  VAE22,4%0 
(1.18) 


m (0) =(1-0)+a:m(0) 


4We prefer to use here the terminology confidence rather than reliability since the notion of reliability is closely related 
to the repetition of experiments with random outputs which may not be always possible in the context of some information 


fusion applications (see example 1.6 given by Shafer on the life on Sirius in [83], p.23) 
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a = 1 means the total confidence in the source while a = 0 means a complete calling in question of 


the reliability of the source. 


e Discounting by convex combination of sources [13]: This method of discounting is based on 
the convex combination of sources by their relative reliabilities, assumed to be known. Let consider 
two independent unreliable sources of evidence with reliability factors a; and az with aj, as € [0, 1], 


then the result of the combination of the discounted sources will be given VA € 2° by 


Bel(A) = —®™ Bel; (A) + —2—Belo(A) (1.19) 
01 + Qa ay +0 


When the sources are highly conflicting and they have been sufficiently discounted, Shafer has 
shown in [33], p. 253, that the combination of a large number n of equally reliable sources using 
Dempster's rule on equally discounted belief functions, becomes similar to the convex combination 
of the n sources with equal reliability factors a; = 1/n. A detailed presentation of discounting 


methods can be found in [3]. 


It is important to note that such discounting methods must not be chosen as an ad-hoc tool to adjust 
the result of the fusion (once obtained) in case of troubles if a counter-intuitive or bad result arises, but 
only beforehand when one has prior information on the quality of sources. In the sequel of the book we will 
assume that sources under consideration are a priori equally reliable /trustable, unless specified explicitly. 
Although being very important for practical issues, the case of the fusion of known unreliable sources of 
information is not considered in this book because it depends on the own choice of the discounting method 
adopted by the system designer (this is also highly related with the application under consideration and 
the types of the sources to be combined). Fundamentally the problem of combination of unreliable sources 
of evidence is the same as working with new sets of basic belief assignments and thus has little interest 


in the framework of this book. 


1.3 Foundations of the DSmT 


1.3.1 Notion of free and hybrid DSm models 


The development of the DSmT arises from the necessity to overcome the inherent limitations of the DST 
which are closely related with the acceptance of Shafer’s model (the frame of discernment © defined as 
a finite set of exhaustive and exclusive hypotheses 6;, i = 1,...,n), the third middle excluded principle 
(i.e. the existence of the complement for any elements/propositions belonging to the power set of ©), 
and the acceptance of Dempter’s rule of combination (involving normalization) as the framework for the 
combination of independent sources of evidence. We argue that these three fundamental conditions of 


the DST can be removed and another new mathematical approach for combination of evidence is possible. 
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The basis of the DSmT is the refutation of the principle of the third excluded middle and Shafer’s 
model, since for a wide class of fusion problems the intrinsic nature of hypotheses can be only vague and 
imprecise in such a way that precise refinement is just impossible to obtain in reality so that the exclu- 
sive elements 0, cannot be properly identified and precisely separated. Many problems involving fuzzy 
continuous and relative concepts described in natural language and having no absolute interpretation 
like tallness/smallness, pleasure/pain, cold/hot, Sorites paradoxes, etc, enter in this category. DSmT 
starts with the notion of free DSm model, denoted M/(@), and considers O only as a frame of exhaustive 
elements 0;, i = 1,..., which can potentially overlap. This model is free because no other assumption is 
done on the hypotheses, but the weak exhaustivity constraint which can always been satisfied according 
the closure principle explained in the introduction of this chapter. No other constraint is involved in the 
free DSm model. When the free DSm model holds, the classic commutative and associative DSm rule 
of combination (corresponding to the conjunctive consensus defined on the free Dedekind’s lattice - see 


next subsection) is performed. 


Depending on the intrinsic nature of the elements of the fusion problem under consideration, it can 
however happen that the free model does not fit the reality because some subsets of O can contain el- 
ements known to be truly exclusive but also truly non existing at all at a given time (specially when 
working on dynamic fusion problem where the frame O varies with time with the revision of the knowl- 
edge available). These integrity constraints are then explicitly and formally introduced into the free DSm 
model Mf (O) in order to adapt it properly to fit as close as possible with the reality and permit to 
construct a hybrid DSm model M(O) on which the combination will be efficiently performed. Shafer’s 
model, denoted M°(@), corresponds to a very specific hybrid DSm model including all possible exclusiv- 
ity constraints. The DST has been developed for working only with M°(@) while the DSmT has been 
developed for working with any kind of hybrid model (including Shafer’s model and the free DSm model), 
to manage as efficiently and precisely as possible imprecise, uncertain and potentially high conflicting 
sources of evidence while keeping in mind the possible dynamicity of the information fusion problem- 
atic. The foundations of the DSmT are therefore totally different from those of all existing approaches 
managing uncertainties, imprecisions and conflicts. DSmT provides a new interesting way to attack the 
information fusion problematic with a general framework in order to cover a wide variety of problems. A 


detailed presentation of hybrid DSm models and hybrid DSm rule of combination is given in chapter Ø 


DSmT refutes also the idea that sources of evidence provide their beliefs with the same absolute in- 
terpretation of elements of the same frame O and the conflict between sources arises not only because of 
the possible unreliabilty of sources, but also because of possible different and relative interpretation of O, 


e.g. what is considered as good for somebody can be considered as bad for somebody else. There is some 
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unavoidable subjectivity in the belief assignments provided by the sources of evidence, otherwise it would 
mean that all bodies of evidence have a same objective and universal interpretation (or measure) of the 
phenomena under consideration, which unfortunately rarely occurs in reality, but when bba are based on 
some objective probabilities transformations. But in this last case, probability theory can handle properly 
and efficiently the information, and the DST, as well as the DSmT, becomes useless. If we now get out of 
the probabilistic background argumentation for the construction of bba, we claim that in most of cases, 
the sources of evidence provide their beliefs about elements of the frame of the fusion problem only based 
on their own limited knowledge and experience without reference to the (inaccessible) absolute truth of 


the space of possibilities. 


The DSmT includes the possibility to deal with evidences arising from different sources of information 
which do not have access to the absolute and same interpretation of the elements of O under consideration. 
The DSmT, although not based on probabilistic argumentation can be interpreted as an extension of 
Bayesian theory and Dempster-Shafer theory in the following sense. Let O = {61,62} be the simplest 


frame made of only two hypotheses, then 


e the probability theory deals, under the assumptions on exclusivity and exhaustivity of hypotheses, 


with basic probability assignments (bpa) m(.) € [0, 1] such that 
e the DST deals, under the assumptions on exclusivity and exhaustivity of hypotheses, with bba 


m(.) € [0, 1] such that 
m(0,) + m(02) + m(01 U 92) = 1 


e the DSmT theory deals, under only assumption on exhaustivity of hypotheses (i.e. the free DSm 
model), with the generalized bba m(.) € [0, 1] such that 


m(01) + m(02) + m(01 U 82) + m(01 N 62) =1 


1.3.2 Notion of hyper-power set DY 


One of the cornerstones of the DSmT is the notion of hyper-power set (see chaptersPland[B]for examples 
and a detailed presentation). Let O = {01,...,0n} be a finite set (called frame) of n exhaustive elements?) 
The Dedekind’s lattice, also called in the DSmT framework hyper-power set D® is defined as the set of 


all composite propositions built from elements of O with U and N eee such that: 


5We do not assume here that elements 0; are necessary exclusive. There is no restriction on 6; but the exhaustivity. 
60 generates DY under operators U and N 
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1. 0, br. 0 € DÈ. 
2. If A,B € D®, then AN B € DP and AUB € DP. 
3. No other elements belong to DY, except those obtained by using rules 1 or 2. 


The dual (obtained by switching U and N in expressions) of DY is itself. There are elements in DO 
which are self-dual (dual to themselves), for example ag for the case when n = 3 in the example below. 
The cardinality of D© is majored by 2?” when the cardinality of O equals n, i.e. |O| =n. The generation 
of hyper-power set DY is closely related with the famous Dedekind problem [8][7] on enumerating the set 
of isotone Boolean functions. The generation of the hyper-power set is presented in chapter] Since for 


any given finite set O, |D®°| > |2°| we call DÌ the hyper-power set of O. 


Example of the first hyper-power sets D® 
e For the degenerate case (n = 0) where O = {}, one has DP = [ay £ Ø} and |D®| = 1. 
e When O = (01), one has DP? = {ao £ 0, a1 £ 61} and |D®| = 2. 


e When O = {01,02}, one has DP? = {ag,a1,...,a4} and [DO] = 5 with ao £ 0, a, £ 0, b2, 


a2 £ 6, 03 £ Oy and a4 £4, U Oo. 


e When O = {01,02,03}, one has DÈ = {ao,a1,...,a1g} and [DO] = 19 with 


ao Ê 
ai £ 01 N 62N 03 aio Ê b2 
az £ 0 N b2 ay, £ 03 
a3 £ 01 N b3 a12 £ (01 N 02) U 03 
aa £ 020 03 a13 £ (01 N 03) U 02 
as £ (01 U 02) N 93 aa £ (02 N 03) U 01 
ag £ (01 U 03) N 02 ais £ 01 U b2 
a7 £ (02 U 03) N 01 aig £ 01 U 63 
3 £ ) U (0,9 03) U (02 N03) a17 Ê 02 U 03 





(01 9 62) U 
01 


o 


¿01U02U03 


Q 


Note that the complement A of any proposition A (except for Ø and for the total ignorance 1, £ 
91U02U...U0,,), is not involved within DSmT because of the refutation of the third excluded middle. 
In other words, VA € DP with A # ( or A # l, Ag D®. Thus (D*9,n,U) does not define a Boolean al- 
gebra. The cardinality of hyper-power set DÈ for n > 1 follows the sequence of Dedekind's numbers [35], 
ie. 1,2,5,19,167,7580,7828353,... (see next chapter for details). 
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Elements 9,, i = 1,...,n of © constitute the finite set of hypotheses/concepts characterizing the fusion 
problem under consideration. DY constitutes what we call the free DSm model Mf (O) and allows to 
work with fuzzy concepts which depict a continuous and relative intrinsic nature. Such kinds of concepts 


cannot be precisely refined in an absolute interpretation because of the unapproachable universal truth. 


However for some particular fusion problems involving discrete concepts, elements 0, are truly exclu- 
sive. In such case, all the exclusivity constraints on 0,, i = 1,...,n have to be included in the previous 
model to characterize properly the true nature of the fusion problem and to fit it with the reality. By 
doing this, the hyper-power set DY reduces naturally to the classical power set 2° and this constitutes 
the most restricted hybrid DSm model, denoted M°(@), coinciding with Shafer’s model. As an exemple, 
let's consider the 2D problem where O = {6,62} with DÈ = {0, 01 N 02, 01, 02, 01 U 02} and assume now 
that 01 and 02 are truly exclusive (i.e. Shafer’s model M? holds), then because 01 N 02 = Ø, one gets 


D? = {0,01 N 02 0,01, 02,01 U 02} = {0, 01, 02, 01 U b2} = 22. 


Between the class of fusion problems corresponding to the free DSm model Mf (O) and the class of 
fusion problems corresponding to Shafer’s model M°(@), there exists another wide class of hybrid fusion 
problems involving in © both fuzzy continuous concepts and discrete hypotheses. In such (hybrid) class, 
some exclusivity constraints and possibly some non-existential constraints (especially when working on 
dynamid4 fusion) have to be taken into account. Each hybrid fusion problem of this class will then be 
characterized by a proper hybrid DSm model M(0) with M(@) 4 MF (©) and M(O) 4 M°(0), see 


examples presented in chapter [4] 


1.33 Generalized belief functions 


From a general frame O, we define a map m(.) : DE — [0,1] associated to a given body of evidence B as 


m(0) =0 and Y” m(A) =1 (1.20) 
AED? 
The quantity m(A) is called the generalized basic belief assignment/mass (gbba) of A. 


The generalized belief and plausibility functions are defined in almost the same manner as within the 
DST, i.e. 
Bel(A) = Y m(B) (1.21) 


BCA 
BED? 


PA) = Y m(B) (1.22) 


Ti.e. when the frame O is changing with time. 
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These definitions are compatible with the definitions of classical belief functions in the DST framework 
when DY reduces to 29 for fusion problems where Shafer's model M°(®) holds. We still have VA € 
DY, Bel(A) < P1(4). Note that when working with the free DSm model Mf (O), one has always PI(A) = 
1VA 40 € DÈ which is normal. 


1.3.4 The classic DSm rule of combination 


When the free DSm model M/(@) holds for the fusion problem under consideration, the classic DSm 
rule of combination m mt(o) = ml.) £ Im, © ma](.) of two independent sources of evidences B, and Ba 
over the same frame O with belief functions Bel,(.) and Belg(.) associated with gbba m,(.) and ma(.) 
corresponds to the conjunctive consensus of the sources. It is given by [9] [TO]: 

VCED®, — Muse (C)=m(C)= Y mi(A)m2(B) (1.23) 

A, BED? 

ANB=C 
Since DP is closed under U and N set operators, this new rule of combination guarantees that m(.) is 
a proper generalized belief assignment, i.e. m(.) : DP — [0,1]. This rule of combination is commutative 
and associative and can always be used for the fusion of sources involving fuzzy concepts. This rule can 
be directly and easily extended for the combination of k > 2 independent sources of evidence (see the 


expression for S1(.) in the next section and chapter [4] for details). 


This classic DSm rule of combination becomes very expensive in terms of computations and memory 
size due to the huge number of elements in DP when the cardinality of O increases. This remark is 
however valid only if the cores (the set of focal elements of gbba) Kı (m1) and K2(mz2) coincide with DÌ, 
i.e. when mı (A) > 0 and m2(A) > 0 for all A # 0 € DP. Fortunately, it is important to note here that in 
most of the practical applications the sizes of Kı (mı) and K2(mz) are much smaller than |D®| because 
bodies of evidence generally allocate their basic belief assignments only over a subset of the hyper-power 


set. This makes things easier for the implementation of the classic DSm rule (123). 


The DSm rule is actually very easy to implement. It suffices for each focal element of Kı(mı) to 
multiply it with the focal elements of K2(ma) and then to pool all combinations which are equivalent 


under the algebra of sets according to figure LI] 


The figure [LJ] represents the DSm network architecture of the DSm rule of combination. The first 
layer of the network consists in all gbba of focal elements A;,i = 1,...,n of m1(.). The second layer 
of the network consists in all gbba of focal elements B;,j = 1,...,k of ma(.). Each node of layer 2 is 
connected with each node of layer 1. The output layer (on the right) consists in the combined basic 


belief assignments of all possible intersections A; N Bj, i = 1,...,n and j = 1,...,k. The last step 
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of the classic DSm rule (not included on the figure) consists in the compression of the output layer by 
regrouping (summing up) all the combined belief assignments corresponding to the same focal elements 
(by example if X = A2 N B3 = A4N Bs, then m(X) = m(42 N Bs) + m(A4 N Bs)). If a third body of 
evidence provides a new gbba mg(.), the one combines it by connecting the output layer with the layer 
associated to m3(.), and so on. Because of commutativity and associativity properties of the classic DSm 


rule, the DSm network can be designed with any order of the layers. 


m(A1 N By) = m1(A1)m2(B1) 
m(An N B1) = mı (An)m2(B1) 


m(Aı N B2) = mı (A1 )m2(B2) 





mrna = ml mal 


m(A1 N Br) = m1(A1)ma(Br) 
m(An N Br) = m1(An)mo(Br) 





Figure 1.1: Representation of the classic DSm rule on Mf (O) 


1.3.5 The hybrid DSm rule of combination 


When the free DSm model M/(@) does not hold due to the true nature of the fusion problem under 
consideration which requires to take into account some known integrity constraints, one has to work with 
a proper hybrid DSm model M(@) 4 M!(0). In such case, the hybrid DSm rule of combination based 
on the chosen hybrid DSm model M(0) for k > 2 independent sources of information is defined for all 


A € DP as (see chapter H] for details): 
muro) (4) ê 4(A) | $1(A) + $2(A) + Sa(4) (1.24) 


where ¢(A) is the characteristic non-emptiness function of a set A, i.e. p(A) =1if A ¢ Ø and p(4) =0 
otherwise, where Ø 2 {04,0}. Øm is the set of all elements of DP which have been forced to be empty 
through the constraints of the model M and 9 is the classical /universal empty set. S1(4) = Mus (0) (4), 
Sa(A), S3(A) are defined by 


s1(4) 3 5 [[ tx) (1.25) 


X1,X0,..,XpED?P i=1 
(X1NX2N...NXk)=A 
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$2(A) £ 5 [mx (1.26) 


X1,X2,..., XyE0 i=1 


53(A) £ 5 [[ mx) (1.27) 


X1,Xo,...,XpED?P iSl 
(X1UXQU...UX,)=A 
(X1NX2N...NXx)E0 


with U 2 u(X1) U u(X2) U...Uu(X¿) where u(X) is the union of all singletons 6; that compose X and 
I, £ 0,U02U...UB,, is the total ignorance. 5;(A) corresponds to the classic DSm rule of combination for 
k independent sources based on the free DSm model M*(0); S2(A) represents the mass of all relatively 
and absolutely empty sets which is transferred to the total or relative ignorances; $3(A) transfers the 


sum of relatively empty sets to the non-empty sets. 


The hybrid DSm rule of combination generalizes the classic DSm rule of combination and is not 
equivalent to Dempter’s rule. It works for any models (the free DSm model, Shafer’s model or any other 
hybrid models) when manipulating precise generalized (or eventually classical) basic belief functions. An 
extension of this rule for the combination of imprecise generalized (or eventually classical) basic belief 


functions is presented in chapter [Band is not reported in this presentation of DSmT. 


1.3.6 On the refinement of the frames 


Let’s bring here a clarification on the notion of refinement and its consequences with respect to DSmT 
and DST. The refinement of a set of overlapping hypotheses O = {6;,i = 1,...,n} consists in getting a 
new finer set of hypotheses 64,71 =1,...,n’, n' > n} such that we are sure that 0; are truly exclusive and 
UR 0, = UM 0), ie. O = {0 i = 1,..., n" > n}. The DST starts with the notion of frame of discern- 
ment (finite set of exhaustive and exclusive hypotheses). The DST assumes therefore that a refinement 
exists to describe the fusion problem and is achievable while DSmT does not make such assumption at its 
starting. The assumption of existence of a refinement process appears to us as a very strong assumption 
which reduces drastically the domain of applicability of the DST because the frames for most of prob- 


lems described in terms of natural language manipulating vague/continuous/relative concepts cannot be 


formally refined at all. Such an assumption is not fundamental and is relaxed in DSmT. 


As a very simple but illustrative example, let's consider O defined as O = {6; = Small, 62 = Tall}. 
The notions of smallness (01) and tallness (02) cannot be interpreted in an absolute manner actually 
since these notions are only defined with respect to some reference points chosen arbitrarily. Two inde- 
pendent sources of evidence (human ”experts” here) can provide a different interpretation of 6; and 62 


just because they usually do not share the same reference point. 0; and 62 represent actually fuzzy con- 
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cepts carrying only a relative meaning. Moreover, these concepts are linked together by a continuous path. 


Let's examine now a numerical example. Consider again the frame O = (0, = Small, 0, £ Tall} on 


the size of person with two independent witnesses providing belief masses 
m,(61) = 0.4 m (62) = 0.5 m1(01 U 92) = 0.1 


ma(01) = 0.6 ma(02) = 0.2 ma(01 U 02) = 0.2 


If we admit that 6; and 62 cannot be precisely refined according to the previous justification, then the 


result of the classic DSm rule (denoted by index DSmc here) of combination yields: 
MpDSsmc(O) =0 MDsSmc(01) = 0.38 MbDpsSmc(02) = 0.22 mpsmc(01U02) = 0.02 mpsmc(01M02) = 0.38 


Starting now with the same information, i.e. mı(.) and ma(.), we volontary assume that a refinement 
is possible (even if it does not make sense actually here) in order to compare the previous result with 
the result one would obtain with Dempster’s rule of combination. So, let’s assume the existence of an 
hypothetical refined frame of discernment O,.; = (0, = Small”, 0, = Medium, 65 = Tall”) where 01, 0) 
and 63 correspond to some virtual exclusive hypotheses such that 01 = 6{U65, 02 = 0,U0% and 0102 = 0% 
and where Small’ and Tall’ correspond respectively to a finer notion of smallness and tallness than in 
original frame O. Because, we don’t change the information we have available (that's all we have), the 


initial bba m1(.) and ma(.) expressed now on the virtual refined power set 2°re/ are given by 
m(6,U 63) =0.4  mi(6,U03)=0.5  mi(01 U83 U83) =0.1 


mi} (0; U 05) = 0.6 m5(05 U 65) = 0.2 m(0; U 65 U 03) = 0.2 


Because O,ef is a refined frame, DST works and Dempster’s rule applies. Because there is no positive 
masses for conflicting terms 04964, 0,0, 05.65 or 040565, the degree of conflict reduces to k12 = 0 
and the normalization factor involved in Dempster’s rule is 1 in this refined example. One gets formally, 


where index DS denotes here Dempster’s rule, the following result: 


mps(0) =0 


mps(0,) = mi (6, U 05)m (65, U 04) + mh (0, U 6, )m, (65, U 64) = 0.2 - 0.4 + 0.5 - 0.6 = 0.38 





maps (6%, U 6%) = mi, (6%, U6,)m'y(0%, U 04) + mi (0, U 0% U 8) (0% U 03) + m(9, U 0% U 0%, )mi, (6%, U 0) 
= 0.4 - 0.6 + 0.1 - 0.6 + 0.2 - 0.4 = 0.38 

mps (6%, U 04) = mi, (0, U 04)m, (0, U 04) + mi (0%, U 6, U 6)’ (0, U 04) + m, (6%, U 0% U 65 )m’, (0, U 6) 
= 0.2.0.5 + 0.1 -0.2 + 0.2 - 0.5 = 0.22 


mps(01 U 65 U 05) = m; (01 U 05 U 63)ms (01 U 65 U 63) = 0.1 - 0.2 = 0.02 
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But since 65 = 01 02, 01 U 05 = 01, 65 U 05 = 02 and 01 U 05 U 03 = 01 U 02, one sees that Dempster’s 
rule reduces to the classic DSm rule of combination, which means that the refinement of the frame O 
does not help to get a more specific (better) result from the DST when the inputs of the problem remain 
the same. Actually, working on O,¢f with DST does not bring a difference with DSmT, but just brings 
an useless complexity in derivations. Note that the hybrid DSm rule of combination can also be applied 
on Shafer’s model associated with O,¢f, but it naturally provides the same result as with the classic DSm 


rule in this case. 


If the inputs of the problem are now changed by re-asking (assuming that such process is possible) 
the sources to provide their revised belief assignents directly on Oref, with m;(0,) > 0, m{(05) > 0 and 
mi(65) > 0 (i = 1,2) rather than on O, then the hybrid DSm rule of combination will be applied instead 
of Dempster’s rule when adopting the DSmT. The fusion results will then differ, which is normal since 


the hybrid DSm rule is not equivalent to Dempster’s rule, except when the conflict is zero. 


1.3.7 On the combination of sources over different frames 


In some fusion problems, it can happen that sources provide their basic belief assignment over distinct 
frames (which can moreover sometimes partially overlap). As simple example, let’s consider two equally 
reliable sources of evidence Bı and B2 providing their belief assignments repectively on distinct frames 


©, and O defined as follows 
O, = {P £ Plane, H £ Helicopter, M £ Missile} 


O, = {S £ Slow motion, F £ Fast motion} 


In other words, m1(.) associated with Bı is defined either on DP or 29 (if Shafer’s model is assumed 
to hold) while ma(.) associated with Bz is defined either on DÉ or 29. The problem relates here to the 


combination of m,(.) with ma(.). 


The basic solution of this problem consists in working on the global tam O = {01,02} and in 
following the deconditionning method proposed by Smets in based on the principle on the minimum 
of specificity to revise the basic belief assignments m,(.) and ma(.) on ©. When additional information 
on compatibility links between elements of 0; and O is known, then the refined method proposed by 
Janez in [21] is preferred. Once the proper model M(0) for © has been chosen to fit with the true nature 
of hypotheses and the revised bba m}¢"(.) and m5°"(.) defined on DÈ are obtained, the fusion of belief 
assignments is performed with the hybrid DSm rule of combination. 


8with suppression of possible redundant elements when ©; and ©% overlap partially. 
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1.4 Comparison of different rules of combinations 


1.4.1 First example 


In this section, we compare the results provided by the most common rules of combinations on the 
following very simple numerical example where only 2 independent sources (a priori assumed equally 
reliable) are involved and providing their belief initially on the 3D frame O = {61, 62,03}. It is assumed 
in this example that Shafer’s model holds and thus the belief assignments m1 (.) and ma(.) do not commit 


belief to internal conflicting information. m1(.) and ma(.) are chosen as follows: 


m (61) =0.1 m (62) = 0.4 m1(@3) = 0.2 my (61 U 02) = 0.1 
ma(01) =0.5 ma(02) = 0.1 ma(03) = 0.3 ma(01 U 02) = 0.1 
These belief masses are usually represented in the form of a belief mass matrix M given by 


0.1 0.4 0.2 0.3 
M = (1.28) 

0.5 0.1 0.3 0.1 
where index i for the rows corresponds to the index of the source no. i and the indexes j for columns 
of M correspond to a given choice for enumerating the focal elements of all sources. In this particular 


example, index j = 1 corresponds to 01, 7 = 2 corresponds to 62, j = 3 corresponds to 63 and j = 4 


corresponds to 6; U @2. 


Now let's imagine that one finds out that 03 is actually truly empty because some extra and certain 
knowledge on 63 is received by the fusion center. As example, 01, 62 and 03 may correspond to three 
suspects (potential murders) in a police investigation, m1(.) and ma(.) corresponds to two reports of 
independent witnesses, but it turns out that finally 03 has provided a strong alibi to the criminal police 
investigator once arrested by the policemen. This situation corresponds to set up a hybrid model M with 


the constraint 03 = Ø (see chapter 4] for a detailed presentation on hybrid models). 


Let's examine the result of the fusion in such situation obtained by the Smets’, Yager’s, Dubois & 
Prade's and hybrid DSm rules of combinations. First note that, based on the free DSm model, one would 


get by applying the classic DSm rule (denoted here by index DSmc) the following fusion result 


mpsmc(01) = 0.21 Mpsmce(G2) = 0.11 MpDsmc(93) = 0.06 MpDsmc(41 U 02) = 0.03 
MDsSmcl01 N 02) = 0.21 MpDsme(A1 N 03) = 0.13 MDSsmc(O2 N 03) = 0.14 


MDpsmc[(03 N (01 U 62)) = 0.11 
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But because of the exclusivity constraints (imposed here by the use of Shafer’s model and by the 


non-existential constraint 03 = Ø), the total conflicting mass is actually given by 
k12 = 0.06 + 0.21 + 0.13 + 0.14 + 0.11 = 0.65 (conflicting mass) 
e If one applies the Disjunctive rule (L6), one gets: 


nit) <0 
m0) = rma (01 rig (01) = 0.105 = 0.08 
mu, (82) = mı (02)ma(02) = 0.4 - 0.1 = 0.04 
mu(0s) = m1 (63)ma(83) = 0.2 - 0.3 = 0.06 
mu (81 U 82) = [m1 (01 U92)ma(01 U92)] + [M1 (01)m2(02) + ma(01)ma (02) 


TF [my (01 )me2(04 U 92) je ma(01)m1 (01 U 93)| 








E [my (92)ma(01 U 92) s mə(b2)mMı (01 U 02)| 
= [0.3 - 0.1] + [0.01 + 0.20] + [0.01 + 015] + [0.04 + 0.03] 
= 0.03 + 0.21 + 0.16 + 0.007 = 0.47 


mu (04 U 03) = m1 (06,)ma(03) T ma(01)m, (03) = 0.03 + 0.10 = 0.13 








mul0a U 03) = m1 (02)ma(03) Eş ma(02)m, (03) = 0.12 + 0.02 = 0.14 


mu(0, U ĝa U 92) = m1(03)ma(01 U 02) = 0.02 + 0.09 = 0.11 


e If one applies the hybrid DSm rule (124) (denoted here by index DSmh) for 2 sources (k = 2), 


one gets: 


mpsmn(0) = 0 
mpsmn(01) = 0.21 + 0.13 = 0.34 


mpsmn(02) = 0.11 + 0.14 = 0.25 





mpsmn(O1 U 62) = 0.03 + [0.2 - 0.1 + 0.3- 0.3] + [0.1 - 0.1 + 0.5 - 0.4] + [0.2 - 0.3] = 0.41 


e If one applies Smets’ rule (L8), one gets: 


ms(0) = m(0) = 0.65 (conflicting mass) 


mg (64 U 02) = 0.03 
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e If one applies Yager’s rule (9), one gets: 


my (0) =0 
my (01) = 0.21 
my (02) = 0.11 


my (01 U 02) = 0.03 + k12 = 0.03 + 0.65 = 0.68 


e If one applies Dempster’s rule (L4 (denoted here by index DS), one gets: 


mps (0) = 0 
mps(01) = 0.21/[1 — kı2] = 0.21/[1 — 0.65] = 0.21 /0.35 = 0.600000 
mps(02) = 0.11/[1 — k12] = 0.11/[1 — 0.65] = 0.11 /0.35 = 0.314286 


mps(01 U 62) = 0.03/[1 — k12] = 0.03/[1 — 0.65] = 0.03/0.35 = 0.085714 


e If one applies Murphy’s rule (L7), i.e average of masses, one gets: 


mu (0) = (0 +0)/2=0 
mm (01) = (0.1 + 0.5)/2 = 0.30 
mu (62) = (0.4 + 0.1)/2 = 0.25 


mm (63) = (0.2 + 0.3) /2 = 0.25 








mm (01 U 62) = (0.3 + 0.1)/2 = 0.20 


But if one finds out with certainty that 03 = Ø, where does mj;(03) = 0.25 go to? Either one 
accepts here that mjz(63) goes to mm (01 U 62) as in Yager’s rule, or my (03) goes to ma (0) as in 


Smets’ rule. Catherine Murphy does not provide a solution for such a case in her paper [27]. 
e If one applies Dubois & Prade's rule (LIQ), one gets because 03 Mo: 
mpp(0) =0 (by definition of Dubois & Prade’s rule) 
mpP(1) = [m,(91)m2(01) + m1(01)ma(01 U 02) + ma(91)m1(01 U B2)] 


+ [m1(01)m2(03) + m2(91)m1(03)] 


= [0.1-0.5+0.1-0.1+0.5 - 0.3] + [0.1 - 0.3 + 0.5 - 0.2] = 0.21 + 0.13 = 0.34 











mpp(62) = [0.4-0.1+0.4-0.1+0.1-0.3] + [0.4-0.3+0.1-0.2] = 0.11 + 0.14 = 0.25 





mper(0; U 92) = m1 (01 U 92)ma(01 U 62)| + [mi (01 U 02)ma(03) + ma(01 U 92)m1 (93) 
+ [m1 (91) m2(82) + m2(91)m1(02)] 


= [0.30.1] + [0.3- 0.3 + 0.1 - 0.2] + [0.1- 0.1 + 0.5 - 0.4] = [0.03] + [0.09 + 0.02] + [0.01 + 0.20] 





= 0.03 + 0.11 + 0.21 = 0.35 
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Now if one adds up the masses, one gets 0+ 0.34+0.25 + 0.35 = 0.94 which is less than 1. Therefore 
Dubois & Prade’s rule of combination does not work when a singleton, or an union of singletons, 
becomes empty (in a dynamic fusion problem). The products of such empty-element columns of the 
mass matrix M are lost; this problem is fixed in DSmT by the sum $(.) in (124) which transfers 


these products to the total or partial ignorances. 


In this particular example, using the hybrid DSm rule, one transfers the product of the empty-element 


63 column, m1(03)m2(63) = 0.2-0.3 = 0.06, to mpsma(01U02), which becomes equal to 0.35+0.06 = 0.41. 


In conclusion, DSmT is a natural extension of DST and Yager’s, Smets’ and Dubois & Prade’s ap- 
proaches. When there is no singleton nor union of singletons empty, DSmT is consistent with Dubois & 
Prade’s approach, getting the same results (because the sum S2(.) is not used in this case in the hybrid 
DSm rule of combination). Otherwise, Dubois & Prade’s rule of combination does not work (giving a 
sum of fusionned masses less than 1) for dynamic fusion problems involving non existential constraints. 
Murphy’s rule does not work either in this case because the masses of empty sets are not transferred. 
If the conflict is k12 is total (i;e. kia = 1, DST does not work at all (one gets 0/0 in Dempster’s rule 
of combination), while Smets’ rule gives ms(Ø) = 1 which is upon to us for the reasons explained in 
this introduction and in chapter D] not necessary justified. When the conflict is total, the DSm rule is 


consistent with Yager’s and Dubois & Prade’s rules. 


The general hybrid DSm rule of combination works on any models for solving static and dynmaic 
fusion problems and is designed for all kinds of conflict: 0 < m(conflict) < 1. When the conflict is 
converging towards zero, all rules (Dempster’s, Yager’s, Smets’, Murphy’s, Dubois & Prade’s, DSmT) 
are converging towards the same result. This fact is important because it shows the connection among 
all of them. But if the conflict is converging towards 1, the results among these rules diverge more and 
more, getting the point when some rules do not work at all (Dempster’s rule). Murphy’s rule is the 
only one which is idempotent (being the average of masses). Dubois & Prade’s rule does not work in 
the Smets’ case (when m(0) > 0). For models with all intersections empty (Shafer’s model) and conflict 
1, Dempster’s rule is not defined. See below example on O = ([0,,02,03,04) with all 6;, i = 1,2,3,4 
exclusive: 


my, (61) = 0.1 m (02) = 0 mı (03) = 0.7 mı(04) = 0 
ma(01) = 0 ma(02) = 0.6 ma(03) = 0 ma(04) = 0.4 


Using Dempster’s rule, one gets 0/0, undefined. Conflicting mass is 1. 
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Yager’s rule provides in this case my (01 U 62 U 63 U 64) = 1 which does not bring specific informa- 
tion, while Smets’ rule gives m(@) = 1 which is also not very useful. Murphy’s rule gives mm (61) = 0.15, 
mm (02) = 0.30, my1(03) = 0.35 and my (04) = 0.20 which is very specific while the hybrid DSm rule pro- 
vides Mpsmn(O1 U02) = 0.18, mpsmn(01 U 94) = 0.12, mpsmn(02U063) = 0.42 and mpsmn(03U 04) = 0.28 
which is less specific than Murphy’s result but characterizes adequately the internal conflict between 


sources after the combination and partial ignorances. 


The disjunctive rule gives in this last example my(01 U 02) = m1(0,)ma2(02) + m2(01)m1 (02) = 0.18. 
Similarly, one gets my(61 U 64) = 0.12, my(02 U 03) = 0.42 and mu(03 U 64) = 0.28. This coincides with 


the hybrid DSm rule when all intersections are empty. 


1.4.2 Second example 


This example is an extension of Zadeh's example discussed in chapter[B] Let's consider two independent 
sources of evidences over the frame O = (61,02, 03,04) and assume that Shafer’s model holds. The basic 


belief assignments are chosen as follows: 
m1(01) = 0.998 m1 (02) =0 m1(03) = 0.001 m1 (04) = 0.001 


ma(01) =0 ma(02) = 0.998 ma(03) =0 ma(04) = 0.02 


In this simple numerical example, Dempster’s rule of combination gives the counter-intuitive result 


0.001 - 0.002 0.000002 _ 


0 = eee O Mloo 
mps(0a) 0.998 - 0.998 + 0.998 - 0.002 + 0.998 - 0.001 + 0.998 - 0.001 + 0.001 - 0.002 0.000002 


Yager’s rule gives my (84) = 0.000002 and my (01 U 02 U 63 U 84) = 0.999998. 


Smets’ rule gives mg(64) = 0.000002 and ms(() = 0.999998. 


Murphy’s rule gives mm (01) = 0.499, Mm m (02) = 0.499, m m (03) = 0.0005 and mm (04) = 0.0015. 


Dubois & Prade’s rule gives mpp(64) = 0.000002, mpp(61 U 02) = 0.996004, mpp(01 U 04) = 0.001996, 
mpp(02 U 03) = 0.000998, mpp(02 U 04) = 0.000998 and mpp(03 U 04) = 0.000002. Dubois & Prade's 
rule works only in Shafer’s model M? (O), i.e. when all intersections are empty. For other hybrid models, 
Dubois & Prade’s rule of combination fails to provide a reliable and reasonable solution to the combination 
of sources (see next example). 

The classic DSm rule of combination provides Mpsme(04) = 0.000002, mpsme(01 N 02) = 0.996004, 
MDSmc(O1N04) = 0.001996, mpsme(02003) = 0.000998, M Dsme(02N04) = 0.000998 and mpsme(03N04) = 
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0.000002. If one now applies the hybrid DSm rule since one assumes here that Shafer’s model holds, one 
gets the same result as Dubois & Prade’s. The disjunctive rule coincides with Dubois & Prade’s rule and 
the hybrid DSm rule when all intersections are empty. 


1.4.3 Third example 


Here is an exemple for the Smets’ case (i.e. TBM) when m() > 0 where Dubois & Prade’s rule of 


combination does not work. Let’s consider the following extended belief assignments 
mi1(0) = 0.2 m1(01) = 0.4 m1(02) = 0.4 
ma(0) = 0.3 ma(01) = 0.6 ma(02) = 0.1 
In this specific case, the Dubois & Prade's rule of combination gives (assuming all intersections empty) 


mpp(0) =0 (by definition) 


mper(01) = m1(01)ma(01) + [mi (0)ma(61) + ma(0)m; (01) = 0.24 + [0.12 + 0.12] = 0.48 








mpp(02) = m1(02)ma(02) + [mi (0)ma(62) + ma(0)m; (82)] = 0.04 + [0.02 0.12] = 0.18 


mpp(01 U 92) = m1(01)ma(02) + ma(01)m, (02) = 0.04 + 0.24 = 0.28 


The sum of masses is 0.48 + 0.18 + 0.28 = 0.94 < 1. Where goes the mass mı (0)ma(0) = 0.2-0.3 = 0.06 ? 
When using the hybrid DSm rule of combination, one gets mpsmn(9) = 0, MDsmn(A1) = 0.48, mpsmnl02) = 


0.18 and 
Mpsmn(O1 U 02) = [m1(0,)m2(02) + ma2(61)m,(02)] + [m,(0)m2(0)] = [0.28] + [0.2 - 0.3] = 0.34 


and the masses add up to 1. 


The disjunctive rule gives in this example 


mu (61) = m1(01)ma (01) ar [mi (0)ma (01) T m2(0)m1 (91 )] = 0.24 [0.12 0.12] = 0.48 

















muy(02) = m1(02)ma (02) ae [mi (0)ma (02) T ma(0)m: (92)] = 0.04 [0.02 0.12] = 0.18 





mu(0, U 92) = m1(01)ma (02) ale ma(01)m, (02) = 0.04 + 0.24 = 0.28 


mu(0) =m1(0)m2(0) = 0.06 > 0 


One gets the same results for my(61), mu(02) as with Dubois & Prade's rule and as with the hybrid DSm 
rule. The distinction is in the reallocation of the empty mass m1 (0)m2(0) = 0.06 to 0, U 2 in the hybrid 


DSm rule, while in Dubois & Prade's and disjunctive rules it is not. 


9We mean here non-normalized masses allowing weight of evidence on the empty set as in the TBM of Smets. 


1.4. COMPARISON OF DIFFERENT RULES OF COMBINATIONS 27 


A major difference among the hybrid DSm rule and all other combination rules is that DSmT uses 
from the beginning a hyper-power set, which includes intersections, while other combination rules need 


to do a refinement in order to get intersections. 


1.4.4 Fourth example 


Here is another example where Dempster’s rule does not work properly (this is different from Zadeh’s 
example). Let's consider © = {0}, 62, 63,64} and assume that Shafer’s model holds. The basic belief 


assignments are now chosen as follows: 


m1(01) = 0.99 m1 (02) =0 mı (03 U 64) = 0.01 
m2(61) =0 ma(02) = 0.98 ma (03 U 04) = 0.02 


Applying Dempster’s rule, one gets mps(01) = mps(02) = 0 and 


0.01 - 0.02 0.0002 0.0002 _ 


AA a A E ES 
mos(8 U4) = 37 [0.99 - 0.98 + 0.99 - 0.02 + 0.98 - 0.01]  1—0.9998 0.0002 


which is abnormal. 


The hybrid DSm rule gives mpgmn(61 U 62) = 0.99 - 0.98 = 0.9702, mpsmnl01 U 03 U 04) = 0.0198, 
Mpsmh(O2 U 03 U 04) = 0.0098 and mpsmn(03 U 04) = 0.0002. In this case, Dubois & Prade's rule gives 
the same results as the hybrid DSm rule. The disjunctive rule provides a combined belief assignment 


mu(.) which is same as Mpsma(.) and mpp(.). 


Yager’s rule gives my(03 U 04) = 0.0002, my (01 U 02 U 03 U 64) = 0.9998 and Smets’ rule gives 
ms(03 U 84) = 0.0002, ms(0) = 0.9998. Both Yager’s and Smets’ results are less specific than the result 
obtained with the hybrid DSm rule. There is a loss of information somehow when using Yager’s or Smets’ 


rules. 


1.4.5 Fifth example 


Suppose one extends Dubois & Prade's rule from the power set 2° to the hyper-power set DY. It can be 


shown that Dubois & Prade's rule does not work when (because S2(.) term is missing): 
a) at least one singleton is empty and the element of its column are all non zero 
b) at least an union of singletons is empty and elements of its column are all non zero 


c) or at least an intersection is empty and the elements of its column are non zero 
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Here is an example with intersection (Dubois & Prade’s rule extended to the hyper-power set). Let’s 


consider two independent sources on O = (01,02) with 
m1(01) =0.5 m1 (02) = 0.1 m1(01 AN 92) = 0.4 


ma(01) = 0.1 ma(02) = 0.6 ma(01 N 02) = 0.3 


Then the extended Dubois & Prade rule on the hyper-power set gives mpp(0) = 0, mpp(61) = 0.05, 
mpp(02) = 0.06, mpp (81 N 02) = 0.04 - 0.3 + 0.5 - 0.6 + 0.5 - 0.3 + 0.1 - 0.4 + 0.1 - 0.3 + 0.6 - 0.4 = 0.89. 
Now suppose one finds out that 01 N 02 = 0, then the revised masses become 


m'pp(0)=0 (by definition) 


mpp(01) = 0.05 y [m:,(01)ma(0, N 02) E ma(01)m, (01 N 2)] = 0.05 Tr [0.5 y 0.3 == 0.1 f 0.4] = 0.24 














mpp(02) = 0.06 on [m (82)me2(O4 N 02) Tr ma(02)m, (01 N 62)] = 0.06 ar [0.1 s 0.3 oF: 0.6 : 0.4] = 0.33 
m'p p(O1 U 92) = m1(02)ma(02) + ma(0,)m: (02) =0.5:-0.6+0.1-0.1=0.31 


The sum of the masses is 0.24 + 0.33 + 0.31 = 0.88 < 1. The mass product m1(01 N 02)ma(01 N 02) = 


0.4 - 0.3 = 0.12 has been lost. 


When applying the classic DSm rule in this case, one gets exactly the same results as Dubois & Prade, 
Le. Mpsme(D) = 0, Mpsme(01) = 0.05, Mpsme(02) = 0.06, Mpsme(01 N02) = 0.89. Now if one takes into 


account the integrity constraint 01 N 92 = Ø and using the hybrid DSm rule of combination, one gets 


mpsmn(0) =0 (by definition) 


mpsmn[01) = 0.05 + [m4 (01 )ma(01 f 02) + ma(0,)m; (01 N 62)] = 0.05 + [0.5 -0.3+0.1 + 0.4] = 0.24 











mpsmn(02) = 0.06 + [m4 (02)m2(01 N 02) + ma(02)m, (0, N 62)] = 0.06 + [0.1 - 0.3 + 0.6 - 0.4] = 0.33 





mpsmnl01 U 92) = [m:(02)ma(02) + ma(01)m: 92) + [m (01 AN 92)ma(01 N 62)] = [0.31] + [0.12] = 0.43 
AAA 


S2 in hybrid DSm rule eq. 


Thus the sum of the masses obtained by the hybrid DSm rule of combination is 0.24 + 0.33 + 0.43 = 1. 


The disjunctive rule extended on the hyper-power set gives for this example 


mu(61) = [my (01 )m2(61)] oa [m (81 )m2(A1 M 02) T m2(61)my (01 N 6)] = 0.05 A [0.15 T 0.04] = 0.24 














mul02) = [my (02)m2(62)] [m4 (02)ma(01 N 02) le ma(02)m: (01 N 62)] = 0.06 + [0.15 0.04] = 0.33 
mu(0, U 02) = [my (02)m2 (62) + m2(61)my1 (82 )] = 0.31 


mu(0, N 02) = m1(01 N 02)ma(01 N 02) = 0.4 $ 0.3 = 0.12 
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If now one finds out that 6162 = (), then the revised masses m/j,(.) become mj,(91) = mu(81), mi (02) = 


mu (62), mi (01 U 92) = mul01 U 92) but mi (0) = mu (61 N 02) = 0.12 > 0. 


1.5 Summary 


DSmT has to be viewed as a general flexible Bottom-Up approach for managing uncertainty and conflicts 
for a wide class of static or dynamic fusion problems where the information to combine is modelled as 
a finite set of belief functions provided by different independent sources of evidence. The development 
of DSmT emerged from the fact that the conflict between the sources of evidence arises not only from 
the unreliability of sources themselves (which can be handled by classical discounting methods), but also 
from a different interpretation of the frame itself by the sources of evidence due to their limited knowlege 
and own (local) experience; not to mention the fact that elements of the frame cannot be truly refined at 
all in many problems involving only fuzzy and continuous concepts. Based on this matter of fact, DSmT 


proposes, according to the general block-scheme in Figure[I.2] a new appealing mathematical framework. 


Here are the major steps for managing uncertain and conflicting information arising from independent 


sources of evidence in the DSmT framework, once expressed in terms of basic belief functions: 


1. Bottom Level: The ground level of DSmT is to start from the free DSm model Mf (©) associ- 
ated with the frame O and the notion of hyper-power set (free Dedekind’s lattice) DY. At this 
level, DSmT provides a general commutative and associative rule of combination of evidences (the 


conjunctive consensus) to work on M (O). 


2. Higher Level (only used when necessary): Depending on the absolute true intrinsic nature (as- 
sumed to be known by the fusion center) of the elements of the frame O of the fusion problem 
under consideration (which defines a set of integrity constraints on M? (O) leading to a particular 
hybrid DSm model M(0)), DSmT automatically adapts the combination process to work on any 
hybrid DSm model with the general hybrid DSm rule of combination explaine in details in chapter 
[4] The taking into account of an integrity constraint consists just in forcing some elements of the 


Dedekind’s lattice DO to be empty, when they truly are, given the problem under consideration. 


3. Decision-Making: Once the combination is obtained after step 1 (or step 2 when necessary), 
the Decision-making step follows. Although no real general consensus has emerged in literature 
over last 30 years to give a well-accepted solution for the decision-making problem in the DST 
framework, we follow here Smets’ idea and his justifications to work at the pignistic level [42] rather 
than at the credal level when a final decision has to be taken from any combined belief mass m(.). 


A generalized pignistic transformation is then proposed in chapter [Z] based on DSmT. 
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Decision-making 


Hybrid DSm rule for hybrid model M(0) 
VAE DO, mayoy(A) = 4(4) [muro (4) + 52(A) + S3(4)] 


Introduction of integrity constraints into DO 
Hybrid model M(0) 


Classic DSm rule based on free model M (O) 


(X1N...NX)=A4 


mı(.): D? = [0,1] | | E | mg(.) : DO = [0,1] 
Source sı i : Source Sy 





Figure 1.2: Block Scheme of the principle for the DSm fusion 


The introduction of a specific integrity constraint in step 2 is like pushing an elevator button for going 
a bit up in the complexity of the processing for managing uncertainty and conflict through the hybrid 
DSm rule of combination. If one needs to go to a higher level, then one can take into account several 
integrity constraints as well in the framework of DSmT. If we finally want to take into account all possible 
exclusivity constraints only (when we really know that all elements of the frame of the given problem are 
truly exclusive), then we go directly to the Top Level (i.e. Shafer’s model which serves as foundation for 
Shafer’s mathematical theory of evidence), but we still apply the hybrid DSm rule instead of Dempster’s 
rule of combination. The DSmT approach for modelling the frame and combining information is more 
general than previous approaches which have been mainly based on the Shafer model (which is a very 


specific and constrained DSm hybrid model) and works for static fusion problems. 


The DSmT framework can easily handle not only exclusivity constraints, but also non existential 
constraints or mixed constraints as well which is very useful in some dynamic fusion problems as it is 
shown in chapter [4] Depending on the nature of the problem, we claim that it is unnecessary to try 
working at the Top Level (as DST does), when working directly at a lower level is sufficient to manage 
properly the information to combine using the hybrid DSm rule of combination. 


lexcept the Transferable Belief Model of Smets [41] and the trade-off/averaging combination rules. 
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It is also important to reemphasize here that the general hybrid DSm rule of combination is definitely 
not equivalent to Dempster’s rule of combination (and to all its alternatives involving conjunctive consen- 
sus based on the Top level and especially when working with dynamic problems) because DSmT allows to 
work at any level of modelling for managing uncertainty and conflicts, depending on the intrinsic nature 
of the problem. The hybrid DSm rule and Dempster’s rule do not provide the same results even when 
working on Shafer’s model as it has been shown in examples of the previous section and explained in 


details in forthcoming chapters] and 5] 


DSmT differs from DST because it is based on the free Dedekind lattice. It works for any model (free 
DSm model and hybrid models - including Shafer’s model as a special case) which fits adequately with 
the true nature of the fusion problem under consideration. This ability of DSmT allows to deal formally 
with any fusion problems expressed in terms of belief functions which can mix discrete concepts with 
vague/continuous/relative concepts. The DSmT deals with static and dynamic fusion problematics in the 
same theoretical way taking into account the integrity constraints into the model which are considered 
either as static or eventually changing with time when necessary. The general hybrid DSm rule of 
combination of independent sources of evidence works for all possible static or dynamic models and 
does not require a normalization step. It differs from Dempster’s rule of combination and from all its 
concurrent alternatives. The hybrid DSm rule of combination has been moreover extended to work for 
the combination of imprecise admissible belief assignments as well. The approach proposed by the DSmT 
to attack the fusion problematic throughout this book is therefore totally new both by its foundations, 


its applicability and the solution provided. 
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Chapter 2 


The generation of hyper-power sets 


Jean Dezert Florentin Smarandache 
ONERA Department of Mathematics 
29 Av. de la Division Leclerc University of New Mexico 
92320 Chatillon Gallup, NM 8730 
France U.S.A. 


Abstract: The development of DSmT is based on the notion of Dedekind’s lattice, 
called also hyper-power set in the DSmT framework, on which is defined the general 
basic belief assignments to be combined. In this chapter, we explain the structure of 
the hyper-power set, give some examples of hyper-power sets and show how they can 
be generated from isotone Boolean functions. We also show the interest to work with 
the hyper-power set rather than the power set of the refined frame of discernment in 


terms of complexity. 


2.1 Introduction 


ne of the cornerstones of the DSmT is the notion of Dedekind’s lattice, coined as hyper-power set 
O by the authors in literature, which will be defined in next section. The starting point is to consider 
O = [01,...,0,) as a set of n elements which cannot be precisely defined and separated so that no 
refinement of O in a new larger set Ores of disjoint elementary hypotheses is possible. This corresponds 
to the free DSm model. This model is justified by the fact that in some fusion problems (mainly those 
manipulating vague or continuous concepts), the refinement of the frame is just impossible to obtain; 


nevertheless the fusion still applies when working on Dedekind’s lattice and based on the DSm rule of 


This chapter is based on a paper presented during the International Conference on Information Fusion, Fusion 2003, 


Cairns, Australia, in July 2003 and is reproduced here with permission of the International Society of Information Fusion. 
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combination. With the DSmT approach, the refinement of the frame is not prerequisite for managing 
properly the combination of evidences and one can abandon Shafer’s model in general. Even if Shafer’s 
model is justified and adopted in some cases, the hybrid DSm rule of combination appears to be a new 
interesting and preferred alternative for managing high conflicting sources of evidence. Our approach 
actually follows the footprints of our predecessors like Yager [23] and Dubois and Prade [7] to circumvent 
the problem of the applicability of Dempster’s rule face to high conflicting sources of evidence but with a 
new mathematical framework. The major reason for attacking the problem directly from the bottom level, 
i.e. the free DSm model comes from the fact that in some real-world applications observations/concepts 
are not unambiguous. The ambiguity of observations is explained by Goodman, Mahler and Nguyen 
in [9| pp. 43-44. Moreover, the ambiguity can also come from the granularity of knowledge, known as 


Pawlak’s indiscernability or roughness [15]. 


2.2 Definition of hyper-power set D® 
The hyper-power set DY is defined as the set of all composite propositions built from elements of O with 
U and N (O generates DP under operators U and N) operators such that 

1. 0,01,...,0, € DP. 

2. If A,B € DP, then AN B € DP and AUB € D®. 

3. No other elements belong to DY, except those obtained by using rules 1 or 2. 


The dual (obtained by switching U and N in expressions) of D® is itself. There are elements in DP which 
are self-dual (dual to themselves), for example ag for the case when n = 3 in the example given in the 
next section. The cardinality of DY is majored by 2?” when Card(O) = |O| = n. The generation of 
hyper-power set DY is closely related with the famous Dedekind problem M B] on enumerating the set 
of monotone Boolean functions as it will be presented in the sequel with the generation of the elements 


of D®. 


2.3 Example of the first hyper-power sets 


e In the degenerate case (n = 0) where O = {}, one has DP = [ay £ Ø} and |D®| = 1. 
e When O = {01}, one has DP = {ao £ Ø, a1 £ 61} and |D®| = 2. 


e When O = {01,02}, one has DP? = {ag,a1,...,a4} and [DO] = 5 with ao £ 0, ai £ 0, b2, 


Q2 £ 4, Q3 £ da and Q4 £6, UB». 


e When O = {01,02,03}, the elements of DP = {ag,a1,...,a1g3} and |D®| = 19 (see [5] for details) 


are now given by (following the informational strength indexation explained in the next chapter): 
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Elements of DO=101,02,03) 


£p 
401902 03 
2 91,002 

2 9003 2 £ (01 N 02) U 03 
2 920 03 3 £ (0, N 03) U b2 
2 (01 U 02) N 03 4 £ (02 N 03) U 01 
2 (0; U 03) N 02 5 £0 Ub 


(02 U 03 N 6 £ 01 U b3 


) 
) 
) 
) 


(0, N 02) U (01 N 03) U (62 N 43) 72 b2 U b3 


bi 3 £ 01 U 02 U 63 








Note that the classical complementary A of any proposition A (except for Ø and ©), is not involved 
within the free DSm model because of the refutation of the third excluded middle; it can however be 
introduced if necessary when dealing with hybrid models as it will be shown in chapter lif we introduce 
explicitly some exclusivity constraints into the free DSm model when one has no doubt on the exclusivity 
between given elements of © depending on the nature of the fusion problem. |D°| for n > 1 follows the 
sequence of Dedekind’s cell 1, 2, 5, 19, 167, 7580, 7828353, 56130437228687557907787... [17]. Note 
also that this huge number of elements of hyper-power set is comparatively far less than the total number 


of elements of the power set of the refined frame O,ef if one would to work on 2°ref and if we admit the 


possibility that such refinement exists as it will be seen in section 2.41] 


2.4 The generation of DY 


2.4.1 Memory size requirements and complexity 


Before going further on the generation of D®, it is important to estimate the memory size for storing 
the elements of DÈ for |O| = n. Since each element of DY can be stored as a 2” — 1-binary string, the 
memory size for DY is given by the right column of the following table (we do not count the size for Ø 
which is 0 and the minimum length is considered here as the byte (8 bits)): 


1 Actually this sequence corresponds to the sequence of Dedekind minus one since we don’t count the last degenerate 


isotone function foan _,(.) as element of DY (see section EJ. 
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4 


1 byte 4 bytes 
1 byte 18 18 bytes 


2 bytes 166 0.32 Kb 


4 bytes 7579 30 Kb 
8 bytes 7828352 59 Mb 
16 bytes | =2.4-10% | 3.6- 10* Gb 
32 bytes | =5.6-10% | 1.710 Gb 





This table shows the extreme difficulties for our computers to store all the elements of DP when |O] > 6. 
This complexity remains however smaller than the number of all Boolean functions built from the ultimate 
refinement (if accessible) 29re7 of same initial frame O for applying DST. The comparison of |D®| with 


respect to |2°ref| is given in the following table 


JO] =n | [D9 | [281] = 2271 


2? = 8 


27 = 128 


215 = 32768 
231 = 2147483648 





Fortunately, in most fusion applications only a small subset of elements of DY have a non null basic 
belief mass because all the commitments are just usually impossible to assess precisely when the dimension 
of the problem increases. Thus, it is not necessary to generate and keep in memory all elements of D® or 
2°ref but only those which have a positive belief mass. However there is a real technical challenge on how 
to manage efficiently all elements of the hyper-power set. This problem is obviously more difficult when 
working on 2°ref. Further investigations and research have to be carried out to develop implementable 
engineering solutions for managing high dimensional problems when the basic belief functions are not 


degenerated (i.e. all m(A) > 0, A € DO or A € 2%res), 


2.4.2 Monotone Boolean functions 


A simple Boolean function f(.) maps n-binary inputs (21,...,2n) € {0,1}" £ {0,1} x... x {0,1} to a 
single binary output y = f(11,...,tn) € {0,1}. Since there are 2” possible input states which can map 
to either 0 or 1 at the output y, the number of possible Boolean functions is 22”. Each of these functions 
can be realized by the logic operations A (and), V (or) and ~ (not) [BIBI]. As a simple example, let's 
consider only a 2-binary input variable (21,2) € {0,1} x {0,1} then all the 2% = 16 possible Boolean 


functions fi(£1, £2) built from (z1, £2) are summarized in the following tables: 


2.4. THE GENERATION OF D® Al 





with the notation Z £ =x, 11 V x2 £ (21 V £2) A (Z1 V Z2) (xor), 2, Vx2 = A(x, V 22) (nor), 1,423 £ 


(a1 A £2) V (Z1 A Z2) (xnor) and z1 A z2 £ =(21 A x2) (nand). 


We denote by Fa(^, V, =) = {fol(£1,..-, En). -, fo2r_1(01,..., Un)) the set of all possible Boolean 
functions built from n-binary inputs. Let x = (z1,..., £n) and x’ £ (x',,...,2'n) be two vectors in 
{0,1}”". Then x precedes x’ and we denote x < x’ if and only if x; < a’; for 1 < i < n (< is applied 


componentwise). If z; < a’; for 1 < i < n then x strictly precedes x’ which will be denoted as x < x’. 


A Boolean function f is said to be a non-decreasing monotone (or isotone) Boolean function (or 
just monotone Boolean function for short) if and only if Vx,x’ € {0,1}" such that x < x’, then 
f(x) < f(x’) [19]. Since any isotone Boolean function involves only A and V operators (no = opera- 
tions) [21] and there exists a parallel between (V, A) operators in logics with (+, -) in algebra of numbers 
and (U,M) in algebra of sets, the generation of all elements of D? built from O with U and N opera- 
tor is equivalent to the problem of generating isotone Boolean functions over the vertices of the unit 
n-cube. We denote by M,,(A, V) the set of all possible monotone Boolean functions built from n-binary 
inputs. M,,(A,V) is a subset of F,(A,V,—7). In the previous example, fi (11,12), f3(a1, 22), fs(x1, £2), 
f7(x1, £2) are isotone Boolean functions but special functions fo(11,12) and fon _¡(%1,..., En) must also 
be considered as monotone functions too. All the other functions belonging to Fa2(A, V, 2) do not belong 
to Ma(A, V) because they require the = operator in their expressions and we can check easily that the 


monotonicity property x < x’ > f(x) < f(x’) does not hold for these functions. 
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The Dedekind’s problem [4] is to determine the number d(n) of distinct monotone Boolean functions 
of n-binary variables. Dedekind [4] computed d(0) = 2, d(1) = 3, d(2) = 6, d(3) = 20 and d(4) = 168. 
Church [1] computed d(5) = 7581 in 1940. Ward computed d(6) = 7828354 in 1946. Church [2] 
then computed d(7) = 2414682040998 in 1965. Between sixties and eighties, important advances have 
been done to obtain upper and lower bounds for d(n) [0 [12] [14]. In 1991, Wiedemann [22] computed 
d(8) = 56130437228687557907788 (200 hours of computing time with a Cray-2 processor) which has 
recently been validated by Fidytek and al. in [8]. Until now the computation of d(n) for n > 8 is still a 
challenge for mathematicians even if the following direct exact explicit formula for d(n) has been obtained 


by Kisielewicz and Tombak (see [LI] [8] for proof) : 


22” 97 15-1 1(i) 


=> ][ [[a-#a-4) AN (2.1) 


k=1 j=1 i=0 
where 1(0) = 0 and l(i) = [logy i] for i > 0, dE £ [k/2"] — 2[k/2+1] and [2] denotes the floor function (i.e. 
the nearest integer less or equal to x). The difficulty arises from the huge number of terms involved in 
the formula, the memory size and the high speed computation requirements. The last advances and state 


of art in counting algorithms of Dedekind’s numbers can be found in [13] [8] [19]. 


2.4.3 Generation of MBF 


Before describing the general algorithm for generating the monotone Boolean functions (MBF), let exam- 
ine deeper the example of section2.4.2) From the previous tables, one can easily find the set of (restricted) 
MBF M3(A,V) = {fo(21, £2) = False, fı (11,12) = £1 Ata, fs(£1, £2) = Ve, f7(11, 12) = 11 V 22) which 
is equivalent, using algebra of sets, to hyper-power set D* = {0,21 £2, £1, £2, £1 U £2} associated with 
frame of discernment X = {x1, 22}. Since the tautology fi5(11,12) is not involved within DSmT, we do 
not include it as a proper element of D* and we consider only M3(A,V) = Ma(A, V) \ {fis} rather than 
Ma(A, V) itself. 


Let’s now introduce Smarandache’s codification for the enumeration of distinct parts of a Venn diagram 
X with n partially overlapping elements x;,¿ = 1,2,...,n. Such a diagram has 2” — 1 disjoint parts. One 
denotes with only one digit (or symbol) those parts which belong to only one of the elements x; (one 
denotes by < i > the part which belongs to x; only, for 1 < i < n), with only two digits (or symbols) 
those parts which belong to exactly two elements (one denotes by < ij >, with i < j, the part which 
belongs to x; and xj only, for 1 <i < j < n), then with only three digits (or symbols) those parts which 
belong to exactly three elements (one denotes by < ijk > concatenated numbers, with i < j < k, the 
part which belongs to zi, zj, and x, only, for 1 < i < j < k < n), and so on up to < 12...n > which 
represents the last part that belongs to all elements x;. For 1 < n < 9, Smarandache’s encoding works 


normally as in base 10. But, for n > 10, because there occur two (or more) digits/symbols in notation of 
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the elements starting from 10 on, one considers this codification in base n + 1, i.e. using one symbol to 





represent two (or more) digits, for example: A = 10, B = 11, C = 12, etc. 





e For n = 1 one has only one part, coded < 1 >. 


e For n = 2 one has three parts, coded < 1 >, < 2 >, < 12>. Generally, < ijk > does not represent 


zi N zj N 2 but only a part of it, the only exception is for < 12...n >. 


e For n = 3 one has 2% — 1 = 7 disjoint parts, coded < 1 >, < 2 >, < 3 >, < 12 >, < 13 >, < 23 >, 
< 123 >. < 23 > means the part which belongs to x2 and x3 only, but < 23 >Æ £2 N 13 because 


z2 Mag = {< 23 >, < 123 >} in the Venn diagram of 3 elements x1, £2, and x3 (see next chapter). 


e The generalization for n > 3 is straightforward. Smarandache’s codification can be organized in a 


numerical increasing order, in lexicographic order or any other orders. 


A useful order for organizing Smarandache's codification for the generation of D* is the DSm-order 


Un = [uz,...,u9n—1]' based on a recursive construction starting with u, £ [< 1 >]. Having constructed 
Un—1, then we can construct Un for n > 1 recursively as follows: 
e include all elements of u,_1 into un; 


e afterwards, include element < n > as well in uy; 


e then at the end of each element of u,_1 concatenate the element < n > and get a new set u'n—1 


which then is also included in uy. 


This is u,, which has (2771 — 1) + 1 + (2771 — 1) = 2” — 1 components. 


For n = 3, as example, one gets uz Ê [< 1 > < 2 > < 12 > < 3 > < 13 > < 23 >< 123 >]. Because 
all elements in u, are disjoint, we are able to write each element d; of D* in a unique way as a linear 


combination of u,, elements, i.e. 
dn = ldi,- .. ,d2n—1] = Dn : Un (2.2) 


Thus u, constitutes a basis for generating the elements of DX. Each row in the matrix D, represents 
the coefficients of an element of DX with respect to the basis un. The rows of D, may also be regarded 


as binary numbers in an increasing order. 
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Example: For n = 2, one has: 


dı = z1 N T2 0 0 1 
<1> 
d2 = £2 0 1 1 
dg = 2 1 0 1 
< 12 > 
d4 = z1 U £2 111 o 
pr) ——" u2 
d2 D2 


where the ”matrix product” is done after identifying (+, -) with (U, N), 0- < x > with Ø and 1. < x > 


with <a>. 


The generation of D* is then strictly equivalent to generate u, and matrix D,, which can be easily 


obtained by the following recursive procedure: 
e start with D§ = [01]' corresponding to all Boolean functions with no input variable (n = 0). 


e build the Df matrix from each row r; of Df by adjoining it to any other row r; of Df such that 
ri Ur; = rj. This is equivalent here to add either 0 or 1 in front (i.e. left side) of rı = 0 but only 
1 in front of rə = 1. Since the tautology is not involved in the hyper-power set, then one has to 


remove the first column and the last line of 


Di = |0 1 to obtain finally D; = 
1 


e build D5 from Df by adjoining to each row r; of Df, any row r; of Df such that r; Ur; = r; and 


then remove the first column and the last line of D$ to get Da as in (2.3). 


e build D$ from D5 by adjoining to each row r; of D§ any row r; of D§ such that r; Ur; =r; and 
then remove the first column and the last line of D§ to get Ds given by (where D’ denotes here the 


transposed of the matrix D) 


000000000 000001 1 1 1 1 
000000 0 0 00 1 11 1000 1 1 
0 0 0000 1 1 1 1 1 1 1 1 1 1 1 1 1 
D;=|0 0000100001001001 01 
0 0 0 1 1 11 00 1 1 10 1 1 1 1 1 1 1 
0 0 1 0 1 1 0 10 1 1 1 1 10 1 1 1 1 
O11 1211121212 i212i21i212 '23 *21 «2 111i 


e Likewise, Df, is built from D;,_, by adjoining to each row r; of D;,_, any row r; of Df, _, such that 


ri Ur; = rj. Then D,, is obtained by removing the first column and the last line of Df. 
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Example for O = {01,02,03}: Note that the new indexation of elements of DP now follows the MBF 


generation algorithm. 


ao = 0 000 0 0 0 0 
a1 2010020 03 0000001 
az = 020 03 0000011 
az £ 01 N 03 0000101 
aa £ (01 U 02) N 03 0000111 
as = 63 0001111 
as £ 01 N b2 0010001 | <1> | 
az £ (0, U 03) N b2 0010011 <2> 
ag = (02 U 03) N 01 0010101 <12> 
ag £ (01 02) U (81 N 03) U (0a N85) =]0 0 1 0 1 1 1|-| <3> 
ao £ (01 N 02) U 03 0011111 <13> 
an Ê b2 0110011 <23 > 
aiz £ (01 N 03) U 02 01101 1 1| [<123> 
013 £ (82 U 03) yitar o~ 
014 = 01 1031031041 
ais £ (02 N 03) U 01 1010111 
aig £ (01 U 03) 1011111 
a17 £ (01 U 02) 1 110111 
aig = (01 U 02 U 03) 1 1 11111 


N, oa e. oa 
d3 D3 


For convenience, we provide in appendix the source code in Matiatl language to generate DP. This 
code includes the identification of elements of DP corresponding to each monotone Boolean function 


according to Smarandache's codification. 


2.5 Conclusion 


In this chapter, one has introduced the notion of Dedekind’s lattice DY (hyper-power set) on which are 
defined basic belief functions in the framework of DSmT and the acceptance of the free DSm model. The 
justification of the free DSm model as a starting point (ground level) for the development of our new 
theory of plausible and paradoxical reasoning for information fusion has been also given and arises from 
the necessity to deal with possibly ambiguous concepts which can appear in real-world applications. The 


lower complexity of the hyper-power set with respect to the complexity of the classical refined power set 


2Matlab is a trademark of The MathWorks, Inc. 
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2°ref has been clearly demonstrated here. We have proven the theoretical link between the generation of 


hyper-power set with Dedekind’s problem on counting isotone Boolean functions. A theoretical solution 


for generating lattices DY has been presented and a MatLab source code has been provided for users 


convenience. 
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APPENDIX 


Appendix: MatLab code for generating hyper-power sets 


DOC CI CCI k kk k k k k 
% Copyright (c) 2008 J. Dezert and F. Smarandache 
% 
% Purpose: Generation of D*Theta for the DSmT for 
% Theta={theta_1,.., Theta.n). Due to the huge 
% # of elements of D"Theta. only cases up to n<7 
% are usually tractable on computers. 
DOC CII CII I k kk k kk k k k k 
n=input(’ Enter „cardinality .foruTheta.(0<n<6).?’); 
% Generation of the Smarandache codification 
% Note: this should be implemented using 
% character strings for n>9 
u_n=[1]; 
for nn=2:n 
u-n=[u_n nn (u_n*10+nnxones (1,size(u.n*10,2))))]; 
end 
% Generation of D.n (isotone boolean functions) 
D_n1=[0;1]; 
for nn=1l:n, Dn=|]; 
for i=1l:size(D_nl1,1),Li=D_n1(i,:); 
for j=i:size(D_n1,1) 
Lj=D_n1(j ,:); Li-inter_Lj=and(Li, Lj); 
Li_-union_Lj=or (Li, Lj); 
if (( Li-inter_Lj==Li)&(Li_-union_Lj==Lj)) 
D.n=[D.n; Li Lj]; 
end 
end 
end 
D_n1=D_n; 
end 
DD=D-_n ;DD(: ,1)=[];DD(size(DD,1) ,:)=[]; 
% Result display 


disp ([ | Theta|=n=’ num2str(n)]) 
disp ([’ |D* Theta|=’ num2str(size(D.n,1))]) 


disp(’Elem.. of D* Thetaware obtained by D_n*u_n’ ) 
disp ([ with u_n=[’ ,num2str(u_n),’]’’ and’ ]) 
D_n=D_n 





Matlab source code for generating DY 
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Abstract: In this chapter, we examine several issues for ordering or partially or- 
dering elements of hyper-power sets involved in the DSmT. We will show the benefit 
of some of these issues to obtain a nice and interesting structure of matrix represen- 


tation of belief functions. 


3.1 Introduction to matrix calculus for belief functions 


A rightly emphasized recently by Smets in [9], the mathematic of belief functions is often cum- 
bersome because of the many summations symbols and all its subscripts involved in equations. 
This renders equations very difficult to read and understand at first sight and might discourage potential 
readers for their complexity. Actually, this is just an appearance because most of the operations encoun- 
tered in DST with belief functions and basic belief assignments m(.) are just simple linear operations 
and can be easily represented using matrix notation and be handled by elementary matrix calculus. We 
just focus here our presentation on the matrix representation of the relationship between a basic belief 
assignment m(.) and its associated belief function Bel(.). A nice and more complete presentation of 
matrix calculus for belief functions can be found in [6] [7] [9]. One important aspect for the simplification 
of matrix representation and calculus in DST, concerns the choice of the order of the elements of the 


This chapter is based on a paper [A] presented during the International Conference on Information Fusion, Fusion 2003, 


Cairns, Australia, in July 2003 and is reproduced here with permission of the International Society of Information Fusion. 
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power set 2°. The order of elements of 2° can be chosen arbitrarily actually, and it can be easily seen 
by denoting m the bba vector of size 2” x 1 and Bel its corresponding belief vector of same size, that 
the set of equations Bel(A) = > gc 4 m(B) holding for all A C © is strictly equivalent to the following 
general matrix equation 


Bel=BM.m & m=BM~'. Bel (3.1) 


where the internal structure of BM depends on the choice of the order for enumerating the elements of 
29, But it turns out that the simplest ordering based on the enumeration of integers from 0 to 2” — 1 
expressed as n-binary strings with the lower bit on the right (LBR) (where n = |@|) to characterize all 
the elements of power set, is the most efficient solution and best encoding method for matrix calculus 
and for developing efficient algorithms in MatLa it or similar programming languages [9]. By choosing 
the basic increasing binary enumeration (called bibe system), one obtains a very nice recursive algorithm 
on the dimension n of O for computing the matrix BM. The computation of BM for |O| = n is just 
obtained from the iterations up to i +1 = n of the recursive relation [9] starting with BMo = [1] and 


where 0;+1 denotes the zero-matrix of size (i + 1) x (i + 1), 





BM;41 = pee Hee (3.2) 
BM, BM; 

BM is a binary unimodular matrix (det(BM) = +1). BM is moreover triangular inferior and symmet- 
rical with respect to its antidiagonal. 
Example for O = {01, 02,03) 
The bibe system gives us the following order for elements of 2° = {ag,..., a7}: 

0=000=0 a=001=% az = 010 = b2 a3 = 011 = 6, U b2 

4¿=100=03 a5 =101 =01 U; aş = 110 = 02 U3 a, =111=0,/U02U03=0 
Each element a; of 2° is a 3-bits string. With this bibe system, one has m = [m(a0),...,m(a7)]' and 


Bel = [Bel(ao),..., Bel(a7)|’. The expressions of the matrix BM3 and its inverse BM3z”! are given by 


J. 0000000 
11000000 
10100000 
11110000 
BM; = 
10001000 
11001100 
10101010 


l Matlab is a trademark of The MathWorks, Inc. 
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| 1 0 0 0 0 0 0 o| 
110 0 0 0 0 0 

10 ı 0 0 0 0 0 

a dt Aaa eya 

BM,” = 

10 0 0 1 0 0 0 

1 0 0 -1 4 0 0 

1 0 -1 0 1 0 1 0 


-1 1 1 -1 1 -1 -i 1 
3.2 Ordering elements of hyper-power set for matrix calculus 


As within the DST framework, the order of the elements of DÌ can be arbitrarily chosen. We denote the 
Dedekind number or order n as d(n) = |D®| for n = |O|. We denote also m the gbba vector of size d(n) x 1 
and Bel its corresponding belief vector of the same size. The set of equations Bel(A) => gepe gca M(B) 


holding for all A € D® is then strictly equivalent to the following general matrix equation 
Bel=BM-m © m=BM”?.Bel (3.3) 


Note the similarity between these relations with the previous ones (3.1). The only difference resides 
in the size of vectors Bel and m and the size of matrix BM and their components. We explore in the 
following sections the possible choices for ordering (or partially ordering) the elements of hyper-power set 
D®, to obtain an interesting matrix structure of BM matrix. Only three issues are examined and briefly 
presented in the sequel. The first method is based on the direct enumeration of elements of DP according 
to their recursive generation via the algorithm for generating all isotone Boolean functions presented in 
the previous chapter and in [3]. The second (partial) ordering method is based on the notion of DSm 
cardinality which will be introduced in section[B.2.2] The last and most interesting solution proposed for 
partial ordering over D® is obtained by introducing the notion of intrinsic informational strength s(.) 


associated to each element of hyper-power set. 


3.2.1 Order based on the enumeration of isotone Boolean functions 


We have presented in chapter Ja recursive algorithm based on isotone Boolean functions for generating 
DY with didactic examples. Here is briefly the principle of the method. Let's consider O = {6;,...,4n} 
satisfying the DSm model and the DSm order u,, of Smarandache’s codification of parts of Venn diagram 
O with n partially overlapping elements 6;,i = 1,...,n. All the elements a; of DY can then be obtained 
by the very simple linear equation dn = D, - un where dn = [ao = V,01,...,%a(n)-1]' is the vector of 
elements of DY, u,, is the proper codification vector and D,, a particular binary matrix. The final result 


dn is obtained from the previous matriz product after identifying (+,-) with (U, N) operators, 0- x with Ø 
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and 1- a with x. Dn is actually a binary matrix corresponding to isotone (i.e. non-decreasing) Boolean 


functions obtained by applying recursively the steps (starting with D§ = [0 1]’) 


e Df is built from D;,_, by adjoining to each row r; of Df _, any row r; of D;,_, such that r¿Ur; = rj. 


Then D, is obtained by removing the first column and the last line of Df. 


We denote riso (a) the position of a; into the column vector d,, obtained from the previous enumer- 
ation/generation system. Such a system provides a total order over D® defined Vai, aj € D® asa; < a; 
(a; precedes ay) if and only if r**°(a;) < r*8° (aj). Based on this order, the BM matrix involved in (3.3) 
presents unfortunately no particular interesting structure. We have thus to look for better solutions for 


ordering the elements of hyper-power sets. 


3.2.2 Ordering based on the DSm cardinality 


A second possibility for ordering the elements of D® is to (partially) order them by their increasing DSm 


cardinality. 


Definition of the DSm cardinality 


The DSm cardinality of any element A € DY, denoted Cm(A), corresponds to the number of parts of A in 
the Venn diagram of the problem (model M) taking into account the set of integrity constraints (if any), 
i.e. all the possible intersections due to the nature of the elements 6;. This intrinsic cardinality depends 
on the model M. M is the model that contains A which depends on the dimension of Venn diagram, 
(i.e. the number of sets n = |O| under consideration), and on the number of non-empty intersections in 
this diagram. Cm(A) must not be confused with the classical cardinality |A| of a given set A (i.e. the 


number of its distinct elements) - that’s why a new notation is necessary here. 


Some properties of the DSm cardinality 


First note that one has always 1 < Cm(A) < 2” — 1. In the (general) case of the free-model M/ (i.e. the 
DSm model) where all conjunctions are non-empty, one has for intersections: 

Cyt) =... = Cms (Ân) =o" 

Cms (0i N 8j) = 277? for n > 2 

Cms (0,0; N Ok) = 2"-3 for n > 3 

It can be proven by induction that for 1 < m < n, one has C ms (0i N bia M...NO;,, ) = 2”7™. For the 
cases n = 1, 2, 3,4, this formula can be checked on the corresponding Venn diagrams. Let's consider this 
formula true for n sets, and prove it for n + 1 sets (when all intersections/conjunctions are considered 
non-empty). From the Venn diagram of n sets, we can get a Venn diagram with n + 1 sets if one draws 


a closed curve that cuts each of the 2” — 1 parts of the previous diagram (and, as a consequence, divides 
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each part into two disjoint subparts). Therefore, the number of parts of each intersection is doubling 
when passing from a diagram of dimension n to a diagram of dimension n + 1. Q.e.d. 
In the case of the free-model Mf, one has for unions: 

Cans (0; U0;) = 3(2"-?) for n > 2 

Cms (0; U 0; U 0k) =7(273) for n > 3 

It can be proven also by induction that for 1 < m < n, one has Cyys(0;, U Oiz U...U9B,,,) = 
(27 —1)(2"-™). The proof is similar to the previous one, and keeping in mind that passing from a Venn 
diagram of dimension n to a dimension n + 1, each part that forms the union 0; 0; N Ok will be split 


into two disjoint parts, hence the number of parts is doubling. 


For other elements A in DÈ, formed by unions and intersections, the closed form for C,,+(A) seems 
more complicated to obtain. But from the generation algorithm of D°, DSm cardinal of a set A from 
DY is exactly equal to the sum of its coefficients in the u, basis, i.e. the sum of its row elements in 
the D,, matrix, which is actually very easy to compute by programming. The DSm cardinality plays in 
important role in the definition of the Generalized Pignistic Transformation (GPT) for the construction of 
subjective/pignistic probabilities of elements of DÈ for decision-making at the pignistic level as explained 
in chapter[ZJand in [5]. If one imposes a constraint that a set B from D® is empty, then one suppresses 
the columns corresponding to the parts which compose B in the D, matrix and the row of B and the 
rows of all elements of DP which are subsets of B, getting a new matrix D”, which represents a new 
model M’. In the u, basis, one similarly suppresses the parts that form B, and now this basis has the 


dimension 2” — 1 — Cy4(B). 


Example of DSm cardinals on M/ 


Consider the 3D case O = {01,02,03} with the free-model M/ corresponding to the following Venn 
diagram (where < i > denotes the part which belongs to 6; only, < ij > denotes the part which belongs 


to 6; and 6; only, etc; this is Smarandache’s codification (see the previous chapter). 


Figure 3.1: Venn Diagram for M/ 
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The corresponding partial ordering for elements of DY is then summarized in the following table: 


A 
Qo = 


ay Ê 61 N b2 N 03 
az £ 01 N b2 

as £ 61 N b3 

as £ b2 N 03 

as £ (01 U 02) N 03 
ag £ (01 U 03) N 02 


Q7 £ (02 U 03 


) 
) 
JNA 
) 


61 N 02) U (01 N 03) U (62 N 83) 


A 
Q12 = 


(0, N 02) U 03 
a13 £ (01 N 03) U 02 
aa £ (02 N 03) U 01 
Q15 £6, Ub 


Q16 2 01 U 3 


Q17 2 02 U 03 





2 
2 
2 
3 
3 
3 
4 
4 
4 
4 
5 
5 
5 
6 
6 
6 
7 


018 £6, U6, U 03 
Table 3.1: Cyys(A) for free DSm model M/ 


Note that this partial ordering doesn’t properly catch the intrinsic informational structure /strength 
of elements since by example (0, N 02) U (01 703) U (02 N03) and 6; have the same DSm cardinal although 
they don’t look similar because the part < 1 > in 601 belongs only to 6; but none of the parts of 
(91, N 62) U (91 N 63) U (02 N 63) belongs to only one part of some 6;. A better ordering function is then 
necessary to catch the intrinsic informational structure of elements of DP. This is the purpose of the 


next section. 


Example of DSm cardinals on an hybrid DSm model M 


Consider now the same 3D case with the hybrid DSm model M 4 Mf in which we force all possible 


conjunctions to be empty, but 91 N 02 according to the following Venn diagram. 
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Figure 3.2: Venn Diagram for M 


The corresponding partial ordering for elements of DÈ is then summarized in the following table: 








Table 3.2: Cm(A) for hybrid DSm model M 


Another example based on Shafer’s model 


Consider now the same 3D case but including all exclusivity constraints on 0,, ¿+ = 1,2,3. This corre- 


sponds to the 3D Shafer's model M? presented in the following Venn diagram. 


0, 02 


03 


Then, one gets the following list of elements (with their DSm cardinal) for the restricted DO, which 


coincides naturally with the classical power set 2°: 


56 CHAPTER 3. PARTIAL ORDERING ON HYPER-POWER SETS 


as = 01 U b3 


ag £ 62 U b3 


az £ 6, U b2 U 63 





Table 3.3: Cmo(A) for Shafer’s model M? 


The partial ordering of DP based on DSm cardinality does not provide in general an efficient solution 
to get an interesting structure for the BM matrix involved in (3.3), contrary to the structure obtained by 
Smets in the DST framework as in section B.I] The partial ordering presented in the sequel will however 


allow us to get such a nice structure for the matrix calculus of belief functions. 


3.2.3 Ordering based on the intrinsic informational content 


As already pointed out, the DSm cardinality is insufficient to catch the intrinsic informational content of 
each element d; of DO. A better approach to obtain this, is based on the following new function s(.), which 
describes the intrinsic information strength of any d; € DP. A previous, but cumbersome, definition of 
s(.) had been proposed in our previous works [I] 2] but it was difficult to handle and questionable with 


respect to the formal equivalent (dual) representation of elements belonging to DY, 


Definition of the s(.) function 


We propose here a better choice for s(.), based on a very simple and natural geometrical interpretation 
of the relationships between the parts of the Venn diagram belonging to each d; € DP. All the values of 


the s(.) function (stored into a vector s) over D® are defined by the following equation: 
g 


s = Dn: Wn (3.4) 


with s £ [s(do) ... s(dp)]” where p is the cardinal of DP for the model M under consideration. p is 
equal to Dedekind’s number d(n) — 1 if the free-model Mf is chosen for © = {01,..., 0n}. Dn is the 
hyper-power set generating matrix. The components w; of vector Wp are obtained from the components 


of the DSm encoding basis vector u,, as follows (see previous chapter for details about D, and un) : 


wi Ê 1/1(u,) (3.5) 
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where l(u;) is the length of Smarandache’s codification u; of the part of the Venn diagram of the model 


M, i.e the number of symbols involved in the codification. 


For example, if u; =< 123 >, then I(u;) = 3 just because only three symbols 1, 2, and 3 enter in the 


codification u;, thus w; = 1/3. 


From this new DSm ordering function s(.) we can partially order all the elements d; € DY by the 


increasing values of s(.). 


Example of ordering on D*=t%%) with M/ 


In this simple case, the DSm ordering of D® is given by 





a4 = 01 U bə 





Based on this ordering, it can be easily verified that the matrix calculus of the beliefs Bel from m by 


equation (8.3), is equivalent to 


Bel(0) 10000 m(0) 
Bel(@1 N 62) 1 1 0 0 0| [m(6,M02) 
Bel(01) |=|1 1 1 00 m(01) 
Bel(62) 1101 0 m(62) 
Bel(9, U 02) 1 1 1 1 1| [m(0,U0») 


where the BM2 matrix has a interesting structure (triangular inferior and unimodular properties, 


det(BM2) = det(BM;*) = 1). Conversely, the calculus of the generalized basic belief assignment 


m from beliefs Bel will be obtained by the inversion of the previous linear system of equations 
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m(0) 1 0 0 0 0 Bel(0) 
m(01 N 0) -1 1 0 0 o0||Be(0,n0s) 
m(6:) |=|0 -1 1 0 O0|| Bel8,) 
m(02) 0 -1 0 1 0|| Bel(6.) 
m(01 U 02) 0 1 -1 -1 1] |Bel(@,U 6.) 


Example of ordering on D*=19,025% with Mf 


In this more complicated case, the DSm ordering of D® is now given by 


0 

01 N 0217 03 
017 b2 

01 N 03 

02 N 03 





(01 U 62) N 03 
(0, U 63) N b2 
(02 U 03 
(81 N 02) U (61 N 93) U (02 N 03) 


0 


nr 


) 
) 
) 
) 


b2 

93 

(01 N02) U b3 
(01 N03) U b2 
(02 N03) UA; 
01 U 03 

01 U 03 

02 U 03 

















01 U 02 U 03 





The order for elements generating the same value of s(.) can be chosen arbitrarily and doesn’t change 
the structure of the matrix BMs given right after. That's why only a partial order is possible from s(.). 
It can be verified that BM; holds also the same previous interesting matrix structure properties and that 


det(BM3) = det(BM; +) = 1. Similar structure can be shown for problems of higher dimensions (n > 3). 
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Although a nice structure for matrix calculus of belief functions has been obtained in this work, and 
conversely to the recursive construction of BM,, in DST framework, a recursive algorithm (on dimension 
n) for the construction of BM, from BM,,_; has not yet be found (if such recursive algorithm exists ...) 


and is still an open difficult problem for further research. 


1000000000 00 0 0 0 0 0 0 0 
1 100000000 0 0 0 00 0 0 0 0 
1 110000000 0 0 0 0 0 0 0 0 0 
1 101000000 00 0 0 0 0 0 0 0 
1 100100000 00 0 0 0 0 0 0 0 
110111000000 0 0 0 0 0 0 0 
1 110310100000 0 0 0 0 0 0 0 
11110001 00 00 0 00 0 0 0 0 
1 1 1 1 1 1 1 1 1 0 000000000 
BM;=|1 1 1100010100000000 0 
1 1 1 0 10 1000100000000 
1 1 O 1 1 10000010000000 
1 1 1 1 1 1 1 1 1 0 0 1 1000000 
1 1 1 1 1 1 1 1 1 0 100100000 
1 1 1 1 1 1 1 1 1 1 0 00010000 
1 1 1 1 1 1 1 1 1 1 1 0 0 1 1 1 0 0 0 
1 1 1 1 1 1 1 1 1 1 0O 1 10 10100 
1 1 1 1 1 1 1 1 1 0O 1 1 1 100010 
8 de ede A Pe E ES E ES EI LO Td 


3.3 Conclusion 


In this chapter, one has analyzed several issues to obtain an interesting matrix representation of the 
belief functions defined in the DSmT. For ordering the elements of hyper-power set D? we propose three 
such orderings: first, using the direct enumeration of isotone Boolean functions, second, based on the 
DSm cardinality, and third, and maybe the most interesting, by introducing the intrinsic informational 
strength function s(.) constructed from the DSm encoding basis. The third order permits to get a nice 
internal structure of the transition matrix BM in order to compute directly and easily by programming 


the belief vector Bel from the basic belief mass vector m and conversely by inversion of matrix BM. 
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Combination of beliefs on hybrid 
DSm models 
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29 Av. de la Division Leclerc University of New Mexico 
92320 Chatillon Gallup, NM 8730 
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Abstract: This chapter presents a general method for combining uncertain and 
paradoxical (i.e. highly conflicting) sources of evidence for a wide class of fusion 
problems. From the foundations of the DSmT we show how the DSm rule of com- 
bination can be extended to take into account all possible integrity constraints (if 
any) of the problem under consideration due to the true nature of elements/concepts 
involved into it. We show how Shafer’s model can be considered as a specific hybrid 
DSm model and can be easily handled by the DSmT and one presents here a new 
efficient alternative to Dempster’s rule of combination, following steps of previous 
researchers towards this quest. Several simple didactic examples are also provided to 


show the efficiency and the generality of the approach proposed in this work. 
4.1 Introduction 


A ccording to each model occurring in real-world fusion problems, we present a general hybrid DSm 
rule which combines two or more masses of independent sources of information and takes care of 
constraints, i.e. of sets which might become empty at time t; or new sets/elements that might arise in the 


frame at time t¡,1. The hybrid DSm rule is applied in a real time when the hyper-power set D? changes 
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(i.e. the set of all propositions built from elements of frame © with U and N operators - see [3] for details), 
either increasing or decreasing its focal elements, or when even © decreases or increases influencing the 


D® as well, thus the dynamicity of our DSmT. 


This chapter introduces the reader to the independence of sources of evidences, which needs to be 
studied deeper in the future, then one defines the models and the hybrid DSm rule, which is different from 
other rules of combination such as Dempster’s, Yager’s, Smets’, Dubois-Prade’s and gives seven numerical 
examples of applying the hybrid DSm rule in various models and several examples of dynamicity of DSmT, 


then the Bayesian hybrid DSm models mixture. 


4.2 On the independence of the sources of evidences 


The notion of independence of the sources of evidence plays a major role in the development of efficient 
information fusion algorithms but is very difficult to formally establish when manipulating uncertain and 
paradoxical (i.e. highly conflicting) sources of information. Some attempts to define the independence of 
uncertain sources of evidences have been proposed by P. Smets and al. in Dempster-Shafer Theory (DST) 
and Transferable Belief Model in [A 3] [14] and by other authors in possibility theory [I] 2] 5) {8} [10]. In 
the following, we consider that n sources of evidences are independent if the internal mechanism by which 
each source provides its own basic belief assignment doesn’t depend on the mechanisms of other sources 
(i.e. there is no internal relationship between all mechanisms) or if the sources don’t share (even partially) 
same knowledge/experience to establish their own basic belief assignment. This definition doesn’t exclude 
the possibility for independent sources to provide the same (numerical) basic belief assignments. The 
fusion of dependent uncertain and paradoxical sources is much more complicated because, one has first 
to identify precisely the piece of redundant information between sources in order to remove it before 


applying the fusion rules. The problem of combination of dependent sources is under investigation. 


4.3 DSm rule of combination for free-DSm models 


4.3.1 Definition of the free-DSm model M* (O) 


Let's consider a finite frame O = {6),...0,} of the fusion problem under consideration. We abandon 
Shafer’s model by assuming here that the fuzzy /vague/relative nature of elements 0; i = 1,...,n of © can 
be non-exclusive. We assume also that no refinement of O into a new finer exclusive frame of discernment 
ere is possible. This is the free-DSm model Mf (©) which can be viewed as the opposite (if we don't 
introduce non-existential constraints - see next section) of Shafer’s model, denoted M? (©) where all 0; 


are forced to be exclusive and therefore fully discernable. 
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4.3.2 Example of a free-DSm model 


Let's consider the frame of the problem O = {01,02,03}. The free Dedekind lattice D° = {ao,..., 018) 


over O owns the following 19 elements (see chapter B) 


Elements of DP for M/(0) 
ay = 
a, £ 01 N82 N03 +0 aio = 02 40 
az 0,0040 ai £ 03 40 
a3 2010040 aiz £ (01 N 02) U 03 4 0 


as 20,00 40 a13 £ (01 N03) U 02 4 0 


as = (01 U 82) 03 #0 ans Ê (02103) U0 40 
as = (01 U 03) N 02 40 ais £ 01 U 02 £ 
a7 Ê (02 U 03) N01 40 aig £ 01 U 03 £0 
ag = (01 N 02) U (81 N 03) U (02 N03) 40 | air Ê 02 U 03 40 
ag £6, 40 aig = 01 U 02 U 03 Æ Ó 





The free-DSm model Mf (©) assumes that all elements a;, i > 0, are non-empty. This corresponds to 


729 
2 


the following Venn diagram where in Smarandache's codification denotes the part of the diagram which 
belongs to 6; only, ”ij” denotes the part of the diagram which belongs to 6; and 6; only, ”i7k” denotes the 
part of the diagram which belongs to 0; and 6; and 6; only, etc [8]. On such Venn diagram representation 
of the model, we emphasize the fact that all boundaries of intersections must be seen/interpreted as only 


vague boundaries just because the nature of elements 0, can be, in general, only vague, relative and even 


imprecise (see chapter [6}. 


0, 0) 


A 
BSA 


Ka 


03 
Figure 4.1: Venn Diagram for Mf (O) 


For the chapter to be self-contained, we recall here the classical DSm rule of combination based on 


M!(0) over the free Dedekind's lattice built from elements of O with N and U operators, i.e. D®. 
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4.3.3 Classical DSm rule for 2 sources for free-DSm models 


For two independent uncertain and paradoxical (i.e. highly conflicting) sources of information (experts /- 
bodies of evidence) providing generalized basic belief assignment m1 (.) and ma(.) over DP (or over any 


subset of D®), the classical DSm conjunctive rule of combination M m (8) (-) 2 [mi 9 my] (.) is given by 


VAADED?, muro (4) [meoma(4)= Y,  mi(X1)m2(X>) (4.1) 

X1,X2€D° 

(X1NX2)=A 
mys(e)(0) = 0 by definition, unless otherwise specified in special cases when some source assigns a 
non-zero value to it (like in the Smets TBM approach [9]). This DSm rule of combination is commutative 


and associative. This rule, dealing with both uncertain and paradoxical/conflicting information, requires 


no normalization process and can always been applied. 


4.3.4 Classical DSm rule for k > 2 sources for free-DSm models 


The above formula can be easily generalized for the free-DSm model Mf (©) with k > 2 independent 


sources in the following way: 


VAH0ED?, mmr (A) £ [m ®... m4] (A) = ` [[ mi) (4.2) 
Lasse e i=l 
Ganka 


mys(e)(0) = 0 by definition, unless otherwise specified in special cases when some source assigns a 


non-zero value to it. This DSm rule of combination is still commutative and associative. 


4.4 Presentation of hybrid DSm models 


4.4.1 Definition 


Let O be the general frame of the fusion problem under consideration with n elements 01, 62, ..., On. 
A hybrid DSm model M(0) is defined from the free-DSm model M/(@) by introducing some integrity 
constraints on some elements A of DY if one knows with certainty the exact nature of the model corre- 
sponding to the problem under consideration. An integrity constraint on A consists in forcing A to be 
empty (vacuous element), and we will denote such constraint as A Y Ø which means that A has been 
forced to Ø through the model M(0). This can be justified by the knowledge of the true nature of each 
element 6; of ©. Indeed, in some fusion problems, some elements 6; and 0; of © can be fully discernable 
because they are truly exclusive while other elements cannot be refined into finer exclusive elements. 
Moreover, it is also possible that for some reason with some new knowledge on the problem, an element 
or several elements 6; have to be forced to the empty set (especially if dynamical fusion problems are 


considered, i.e when © varies with space and time). For example, if we consider a list of three potential 
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suspects into a police investigation, it can occur that, during the investigation, one of the suspects can 
be withdrawn of the initial frame of the problem if his innocence is proven with an ascertainable alibi. 
The initial basic belief masses provided by sources of information one had on the three suspects, must 


then be modified by taking into account this new knowledge on the model of the problem. 


There exists several possible kinds of integrity constraints which can be introduced in any free-DSm 
model M(0) actually. The first kind of integrity constraint concerns exclusivity constraints by taking 
into account that some conjunctions of elements 0,,...,0x are truly impossible (i.e. 0:9... Ôk x 0). 
The second kind of integrity constraint concerns the non-existential constraints by taking into account 
that some disjunctions of elements 0,,...,0x are also truly impossible (i.e. 0,¿U...U0x x Ø). We exclude 
from our presentation the completely degenerate case corresponding to the constraint 6; U ... U On x 0) 
(total ignorance) because there is no way and no interest to treat such a vacuous problem. In such a 
degenerate case, we can just set m(@) = 1 which is useless because the problem remains vacuous and DY 
reduces to Ø. The last kind of possible integrity constraint is a mixture of the two previous ones, like for 
example (0; M0) U 0, or any other hybrid proposition/element of DP involving both N and U operators 
such that at least one element 6; is a subset of the constrained proposition. From any M (O), we can 
thus build several hybrid DSm models depending on the number of integrity constraints one needs to fully 
characterize the nature of the problem. The introduction of a given integrity constraint A x 0 € DS 
implies necessarily the set of inner constraints B x Ø for all B C A. Moreover the introduction of two 
integrity constraints, say on A and B in DÌ implies also necessarily the constraint on the emptiness of the 
disjunction AUB which belongs also to DP (because DY is closed under N and U operators). This implies 
the emptiness of all C € DÈ such that C C (AU B). The same remark has to be extended for the case 
of the introduction of n integrity constraints as well. Shafer’s model is the unique and most constrained 
hybrid DSm model including all possible exclusivity constraints without non-existential constraint since 
all 6, 4 0 € O are forced to be mutually exclusive. Shafer's model is denoted M°(@) in the sequel. We 
denote by Øm the set of elements of DP which have been forced to be empty in the hybrid DSm model 
M. 


4.4.2 Example 1 : hybrid DSm model with an exclusivity constraint 


Let O = {01,02,03} be the general frame of the problem under consideration and let's consider the 

following hybrid DSm model M,(@) built by introducing the following exclusivity constraint a; £ 9, N 
M 

02 N 03 =' Ø. This exclusivity constraint implies however no other constraint because a; doesn't contain 


other elements of DÈ but itself. Therefore, one has now the following set of elements for DÌ 
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Elements of DÈ for M1(0) 
ag = 
A Mı A 
ay 2 0 N02 N03 = Q aio = 02 4 


az 2010040 ai +63 40 


a3 20,00 40 aiz £ (01 N 02) U 03 4 0 


as 202003 #0 ais = (01 N03) U 02 40 
as £ (0, U 02) N b3 £0 aia £ (02 N03) U0, £0 
as = (01 U 03) 002 40 ais = 0,U 02 40 
a7 = (02 U 03) 01 40 ais = 01 U 03 £0 
ag = (01 N 02) U (01 N 03) U (02103) 40 | a17 = 02 U 03 40 
ag £6, 40 aig = 01 U 02 U 03 40 





Hence the initial basic belief mass over D® has to be transferred over the new constrained hyper-power 
set D9(M(0)) with the 18 elements defined just above (including actually 17 non-empty elements). The 
mechanism for the transfer of basic belief masses from DY onto D? (M; (O)) will be obtained by the hybrid 


DSm rule of combination presented in the sequel. 


4.4.3 Example 2 : hybrid DSm model with another exclusivity constraint 


As the second example for a hybrid DSm model M2(0), let's consider O = {0}, 02, 03} and the following 
exclusivity constraint az + 6,62 2 Ø. This constraint implies also a; £ 0102003 2 since ay C a2. 


Therefore, one has now the following set of elements for D°(M2(@)) 


Elements of DÈ for M2(@) 


020,40 
12040 
a320,00 4 2 = (01 N 02) U 03 
as 202003 40 3 £ (0,03) U0 40 
as £ (01 U 02) N 03 40 4 £ (0263) U0, 40 
L 5 £0 Uh 490 

6 01 U03 #0 

as AN 7 = 02U 03 40 

3 £ 01 U 02 U 03 40 


Ma 


au 4 


61U03)N 0, = 
Me 


ae = ( ) 
a7 £ (02 U 03) N 01 
as = ( ) 


61,62) U (01 N 03) U (92 N 03) ga 








Note that in this case several non-empty elements of DP9(M2(0)) coincide because of the constraint 
M M M 
(ag = a4, a7 = ag, ag = as, 12 = a1). DP(Ma2(0)) has now only 13 different elements. Note 
M M 
that the introduction of both constraints a, £ 61 N 02963 = Ø and az Ê 01 62 F Ø doesn’t change 


the construction of D°(M2(@)) because ay C a2. 
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4.4.4 Example 3: hybrid DSm model with another exclusivity constraint 


As the third example for a hybrid DSm model M3(0), let's consider O = {0}, 62,03} and the following 
exclusivity constraint ag £ (01 U 03) N 02 is Ø. This constraint implies now a, £ 61 N 02 03 Es Ø since 
a, C ag, but also az £ 01 N b2 2 Ú because az C ag and a4 £ 02 N 03 ‘4s Ü because a4 C ag. Therefore, 


one has now the following set of elements for D°(M3(@)) 


Elements of DÈ for M3(0) 


0 =, 40 
12040 

2 = (01 N 02) U 03 
32 (0,03) U0. 40 

¿2 (0203) U0, L ay 40 
5201U0 40 

as £0 6 01 U00 
(01.03) U (0203) E as 40 | air 2080340 

3 £ 01 U 02 U 03 40 


M3 


au #90 


w 


Sa e 
w 
MH 
A 


N 


DD DS 
mX mg mš 


1 








D*(M3(0)) has now only 10 different elements. 


4.4.5 Example 4: Shafer’s model 


As the fourth particular example for a hybrid DSm model M4(0), let's consider © = {61, 02,03} and 
the following exclusivity constraint ag = ((01 N 02) U 03} N (01 U 62) L 9, Therefore, one has now the 
following set of elements for DP(M4(0)) 


Elements of DY for M4(0) (Shafer’s model) 
Q10 = 02 a 0 


ai = 03 40 
Q12 2 (01 N 82) U 83 


ES 


tan #0 
tawh 
a9 #0 


aa £0263 E 0 a13 £ (01 N 03) U 02 


TESES 


as £ (01 U 02) N 03 aa £ (02 N 03) U 01 
ais = 0,U0 40 
ais = 01 U03 40 
017 02 U 03 #0 


ag 26,40 aig Ê 01 U02U 03 40 


N 0, 


WS gs Us 


[S 


a aAa => 


) 
) 
JNA 
) 


Mg 


Qg (01 N 62) U (01 M 63) U (02 (A 03) 
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This model corresponds actually to Shafer’s model M? (O) because this constraint includes all possible 
exclusivity constraints between elements 6;, i = 1,2,3 since ay £ 6,02 63 C ag, a2 £ 01 N 02 Cas, 
az £ 01 N03 C ag and a4 Ê 02 N 03 C ag. DO(MA(O)) has now 2!°! = 8 different elements and 
coincides obviously with the classical power set 2°. This corresponds to Shafer’s model and serves as the 


foundation for Dempster-Shafer Theory. 


4.4.6 Example 5: hybrid DSm model with a non-existential constraint 


As the fifth example for a hybrid DSm model M5(0), let’s consider O = {6}, 02, 03} and the following non- 
existential constraint a9 $ 64 2 Ø. In other words, we remove 6; from the initial frame O = ae 02,03). 
This non-existential constraint implies a; £ 61 N 02M q E =0024n E "da 40,9 a 2 = ý and 


2 (02 U 03) N o “4 ” Ø. Therefore, one has now the following set of elements for DP(M5(0)) 


Elements of DY for M5(0) 


020.40 

12040 

2 = (01 02) U 03 

as 202003 40 3 £ (0, N 03) U b2 
as £ (01 U 02) N 03 as AN 4 = (02 N 83) U 61 
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au #0 
ao #0 
as AN 


Ms 
Ms 
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01 U 03) N 02 
2 (02 U 03) N 01 
) U (81 N 63) U (82 N 63) 


Ms IIS ng 


M 
hie = 0140 


= (0 
a E 0 3 £ 01 U b2 U 03 








M: 


017 AN 


D*(M5(0)) has now 5 different elements and coincides obviously with the hyper-power set D°\%, 


4.4.7 Example 6 : hybrid DSm model with two non-existential constraints 


As the sixth example for a hybrid DSm model Mg(0), let's consider O = ([01,02,03) and the following 
two non-existential constraints ag £ 6; ‘As Ø and aio £ 62 ‘As Ø. Actually, these two constraints are 
equivalent to choose only the following constraint a15 £ 01 Ub 25 Ø. In other words, we remove now 
both 01 and 02 from the initial frame O = ia These oa constraints implies now 
0, 
ag £ A "O, a7 Ê ane L A, ag 2 {(01 N 02) U 03} N (9, U 02) 9, ag 4 


A Nea es = no © as £ 0.103 2 ° Ø, as £ Av) E 


(910 63) U 6,4 = 0, arg £ (82 N 03) U a E = Ø. Therefore, one has now the following set of elements for 
D? (M6(0)): 


4.4. PRESENTATION OF HYBRID DSM MODELS 69 


Elements of DO for M¢(@) 


ao = 


ai +03 40 


Q12 4 (01 Q 02 Q11 Æ 0 


3 


Q13 = 2 (01 N 83 


a 
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YS we wa 
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Q14 = 2 (02 N 83 


0 
b2 
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0 


Qis = 20 U bə 


ro 
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WS WS mš 


ais =0,U0 = 011 #0 
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ans £ 01 U 02 U 63 E = an AN 


Me 
Me 
Me 


Q17 4 0, U 03 





DE(M6(©)) reduces now to only two different elements Y and 03. D°(Mg(®)) coincides obviously 
with the hyper-power set D®©\t%-%1. Because there exists only one possible non empty element in 
D®°(Mg(Q)), such kind of a problem is called a trivial problem. If one now introduces all non-existential 
constraints in the free-DSm model, then the initial problem reduces to a vacuous problem also called the 
impossible problem corresponding to m(0) = 1 (such kind of a ” problem” is not related to reality). Such 
kinds of trivial or vacuous problems are not considered anymore in the sequel since they present no real 


interest for engineering information fusion problems. 


4.4.8 Example 7 : hybrid DSm model with a mixed constraint 


As the seventh example for a hybrid DSm model M7(0), let's o O = [01,02,03) and the following 
mixed ios and non-existential constraint aj = = M42) U dE ” Ø. This mixed constraint ae 
ay 2900.00 2 = f, az 2 6192 4 = p, 2900 E "0, a4 4 6,163 E = f, as ê (0; U62) N03 E =" f, 

2 (9; 03) 02 L 0, ar 2 (0203) 01 L 0, as 2 {(0162) U3}. (6, U 82) so 0 and ay, £ 63 2 0, 


Therefore, one has now the following set of elements for D°(M7(@)) 
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Elements of DO for Mr(0) 


ao = 


Q12 = 2 (01 M 02 
Q13 = 2 (01 N 83 T 0 # 0 


"ag £0 


D D 
WS Ws E 
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D®°(M7(@)) reduces now to only four different elements Ø, 01, 02, and 0, U 02. 


4.5 DSm rule of combination for hybrid DSm models 


In this section, we present a general DSm-hybrid rule of combination able to deal with any hybrid DSm 
models (including Shafer’s model). We will show how this new general rule of combination works with 
all hybrid DSm models presented in the previous section and we list interesting properties of this new 


useful and powerful rule of combination. 


4.5.1 Notations 


Let O = {01,...0,} be a frame of partial discernment (i.e. a frame O for which at least one conjunctive 
element of DÈ \ {Ø} is known to be truly empty) of the constrained fusion problem, and DY the free 
distributive lattice (hyper-power set) generated by O and the empty set Ø under N and U operators. We 
need to distinguish between the empty set Ø, which belongs to DÌ, and by Ø we understand a set which 
is empty all the time (we call it absolute emptiness or absolutely empty) independent of time, space and 
model, and all other sets from D®. For example 6; N 92 or 6, U 62 or only 6; itself, 1 < i < n, etc, 
which could be or become empty at a certain time (if we consider a fusion dynamicity) or in a particular 
model M (but could not be empty in other model and/or time) (we call a such element relative emptiness 
or relatively empty). We'll denote by Øm the set of relatively empty such elements of DÌ (i.e. which 
become empty in a particular model M or at a specific time). Øm is the set of integrity constraints which 
depends on the DSm model M under consideration, and the model M depends on the structure of its 
corresponding fuzzy Venn Diagram (number of elements in O, number of non-empty intersections, and 
time in case of dynamic fusion). Through our convention 0 ¢ Øm. Let's note by Ø 2 {0, Øm} the set of 


all relatively and absolutely empty elements. 
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For any A € D®, let ¢(A) be the characteristic non emptiness function of the set A, i.e. p(A) = 1 
if A ¢ Ø and ¢(A) = 0 otherwise. This function assigns the value zero to all relatively or absolutely 
empty elements of DY through the choice of hybrid DSm model M. Let's define the total ignorance 
on O = {01,02,..., 0n} as I; £ 01U02U...U On and the set of relative ignorances as I, & {6;, U 
...U6;,, where ¿1,...,1x € (1,2,...,nj and 2 < k < n— 1}, then the set of all kind of ignorances as 
TI =I,UI,. For any element A in DÈ, one considers u(A) as the union of all singletons 0, that compose 
A. For example, if A is a singleton then u(A) = A; if A = 61 02 or A = 6; U O2 then u(A) = 01 U b2; if 
A = (01M02)U03 then u(A) = 01U062U83. ; by convention u(@) = Ø. The second summation of the hybrid 
DSm rule (see eq. (£3) and and denoted Sz in the sequel) transfers the mass of @ [if any; sometimes, 
in rare cases, m(0) > 0 (for example in Smets’ work); we want to catch this particular case as well] to the 
total ignorance J; = 01 U62U...U6,. The other part of the mass of relatively empty elements, 0; and 0; 
together for example, i  j, goes to the partial ignorance/uncertainty m(9;U0;). S2 multiplies, naturally 
following the DSm classic network architecture, only the elements of columns of absolutely and relatively 
empty sets, and then S transfers the mass m1(X1)m2(X2)...m (Xx) either to the element A € D? in 
the case when A =u(X¡)Uu(X>2)U...Uu(Xz) is not empty, or if u(X1) Uu(X2) U... u(X;) is empty 
then the mass m,(X1)m2(X3)...ma(X) is transferred to the total ignorance. We include all degenerate 
problems/models in this new DSmT hybrid framework, but the degenerate/vacuous DSm-hybrid model 


M 
Mg defined by the constraint I, = 01 U 02 U... U On =" Ø which is meaningless and useless. 


4.5.2 Programming of the u(X) function 


We provide here the issue for programming the calculation of u(X) from the binary representation of 
any proposition X € DY expressed in the Dezert-Smarandache order (see chapters B] and [B). Let's con- 
sider the Smarandache codification of elements 0,,...,0,. One defines the anti-absorbing relationship 
as follows: element ¿ anti-absorbs element ij (with i < j), and let's use the notation i << ij, and also 
j << ij; similarly ij << ijk (with i < j < k), also jk << ijk and ik << ijk. This relationship is 
transitive, therefore i << ij and ij << ijk involve i << ijk; one can also write i << ij << ijk as a 
chain; similarly one gets 7 << ijk and k << ijk. The anti-absorbing relationship can be generalized for 
parts with any number of digits, i.e. when one uses the Smarandache codification for the corresponding 
Venn diagram on O = {61,62,...,0n}, with n > 1. Between elements ij and ik, or between ij and jk 
there is no anti-absorbing relationship, therefore the anti-absorbing relationship makes a partial order on 
the parts of the Venn diagram for the free DSm model. If a proposition X is formed by a part only, say 
iyig...t,, in the Smarandache codification, then u(X) = 6;, U ĝi, U... U 0i.. If X is formed by two or 
more parts, the first step is to eliminate all anti-absorbed parts, ie. if A << B then u(A, B) = u(A); 
generally speaking, a part B is anti-absorbed by part A if all digits of A belong to B; for an anti- 


absorbing chain Ay << Ag << ... << A, one takes A; only and the others are eliminated; afterwards, 
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when X is anti-absorbingly irreducible, u(X) will be the unions of all singletons whose indices occur in 
the remaining parts of X - if one digit occurs many times it is taken only once. For convenience, one 
provides below the MatLatll source code for computing u(X), X € DY. The input variable un of this 


routine corresponds to the DSm base encoding and can be obtained by the method proposed in chapterP] 


Otaola OK k k a K K K Kk k k k 

function [UX]=GetUX(u_n ,X); 

OO 2 OOK OKO OR OR OR OR k k k k k k K k k k k k k k K K K k k k k k k 2k K K Kk k kk 
GetUX computes the function u(X) involved 
in the DSm hybrid rule of combination. 
Inputs : un => Dezert—Smarandache base encoding 

X => Element of D^Theta in base u_n 
Example for n=3: if Theta=[thetal , theta2, theta3} 
then u-8=[1 2 12 3 13 28 1283] 

Output : Ux => u(X) expressed in base un 
Copyrights (c) 2003 — J. Dezert & F. Smarandache 

Qo 2 OKO ORO OR OR OR k k k ooo a 2k ak K Kk k k k 

UX=zeros (1,size(u_n,2));XP=u_n(find (X==1))’; 

AF=zeros(size(XP,1) ,1); XC=[]; 

for jj=1:size(XP,1) 

if (AF( jj )==0),ujj=num2str(XP( jj )); 

for kk=1:size(XP,1) 

if (AF(kk)==0) 

ukk=num2str(XP(kk));w=intersect (ujj ,ukk); 

if (isempty (w)==0), 

if ((isequal (w, ujj)+isequal (w, ukk)) >0) 

XC=[XC; str2num(w) ] ; 

if (size (ujj ,2)<size(ukk,2)) ,AF(kk)=1l;end 

if (size (ukk,2)<size(ujj ,2)),AF(jj )=l;end 

end; end; end; end; end; end 

XC=unique (XC) ;XCS=unique (num2str(XC”)); 

for ii=1:size(XCS,2) ,if (XCS(ii)"=’.’) 

for jj=l:size(u_n,2) 


if (isempty (intersect (XCS( ii) ,num2str(u_n(jj ))))==0) 


UX( jj )=1;end; end ; end ; end 





Matlab source code for computing u(X), X € DO 


Here are some examples for the case n = 3: 12 << 123, i.e. 12 anti-absorbs 123. Between 12 and 23 


there is no anti-absorbing relationship. 
e If X = 123 then u(X) = ĝı U b2 U 03. 


e If X = {23,123}, then 23 << 123, thus u({23, 123}) = u(23), because 123 has been eliminated, 
hence u(X) = u(23) = 02 U 03. 


e If X = {13,123}, then 13 << 123, thus u({13, 123}) = u(13) = 01 U b3. 


l Matlab is a trademark of The MathWorks, Inc. 
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If X = (13,23, 123}, then 13 << 123, thus u({13, 23, 123}) = u((13, 23}) = 0; U 62 U 63 (one takes 
as theta indices each digit in the {13, 23}) - if one digit is repeated it is taken only once; between 


13 and 23 there is no relation of anti-absorbing. 


If X = (3,13,23,123), then u(X) = u((3,13,23)) because 23 << 123, then u({3, 13, 23}) = 
u({3, 13}) because 3 << 23, then u({3, 13}) = u(3) = 03 because 3 << 13. 


If X = {1, 12,13, 23, 123}, then one has the anti-absorbing chain: 1 << 12 << 123, thus u(X) = 
u({1, 13, 23}) = u({1, 23}) because 1 << 13, and finally u(X) = 01 U 02 U 03. 


If X = {1,2,12,13,23,123}, then 1 << 12 << 123 and 2 << 23 thus u(X) = u({1,2,13}) = 
u((1, 2)) because 1 << 13, and finally u(X) = 01 U 62. 


If X = (2,12,3,13,23,123), then 2 << 23 << 123 and 3 << 13 thus u(X) = u({2, 12,3}), but 
2 << 12 hence u(X) = u({2,3}) = 02 U 03. 


4.5.3 The hybrid DSm rule of combination for 2 sources 


To eliminate the degenerate vacuous fusion problem from the presentation, we assume from now on that 
the given hybrid DSm model M under consideration is always different from the vacuous model Mg (i.e. 
I, 4 0). The hybrid DSm rule of combination, associated to a given hybrid DSm model M 4 Mg , for 


two sources is defined for all A € D® as: 


maao(A) 2ó(4)] Y) mi(X)ma%) 


X1,X2€D°? 
(XiNX2)=A 


+ y m4 (X1)m2(X2) 


X1,X2€0 
[(u(X1JUu(Xa))=4JV ((u(Xi)Uu(X2)E0)MA=1,)] 


+ Y) mi(X)ma(Xe)] (43) 


X1¡,X2€D9 
(X1UX2)=A 
XiNX2E0 


The first sum entering in the previous formula corresponds to mass mys(e)(4) obtained by the classic 
DSm rule of combination (£I) based on the free-DSm model M/ (i.e. on the free lattice D®), i.e. 
muse (4) = 5 mı(Xı)m2(X2) (4.4) 
Xı,X2€D° 
(X1NX2)=A 
The second sum entering in the formula of the DSm-hybrid rule of combination (41.3) represents the 
mass of all relatively and absolutely empty sets which is transferred to the total or relative ignorances. 
The third sum entering in the formula of the DSm-hybrid rule of combination (4.3) transfers the sum 
of relatively empty sets to the non-empty sets in a similar way as it was calculated following the DSm 


classic rule. 
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4.5.4 The hybrid DSm rule of combination for k > 2 sources 


The previous formula of hybrid DSm rule of combination can be generalized in the following way for all 


Ae D®: 


muoia) E Mw 


X1,Xo,04,X~pED? 1S1 
(X1NX2N...NXk)=A 


k 
$ > [m 
X1,X2,... Xk E i=1 
[(u(X1)Uu(X2)U...Uu(Xk)) = AV [ul X1 )Uu(X2)U...Uu(Xk)EØ)A(A=I)] 


+ E [mæ] as 


X1,X2,..,X~ED? i=1 
(X1UX2U...UXk)=A 
XıNX2N...NXk E0 


The first sum entering in the previous formula corresponds to mass m ms(o)(A) obtained by the classic 
DSm rule of combination (4.2) for k sources of information based on the free-DSm model M/ (i.e. on 


the free lattice D®), i.e. 


mmst(@)(A) 3 5 II mi(X;) (4.6) 


4.5.5 On the associativity of the hybrid DSm rule 


From (45) and (4.0), the previous general formula can be rewritten as 


muro) (4) © 4(A) | $1(A) + 82(4) + $5(A)] (4.7) 
where a 
S1(A) =Mus(ey (4) Ê 5 [[ ice) (4.8) 


X1,X0,..,XpED?P i=1 
(X1NX2N...NXk)=A 


k 
S2(A) È 5 Il mi( Xi) (4.9) 
X1,X2,...,X€0 i=l 
[(u(X, JUn(Xa)U...Uu(Xr))=A]V[((X1JUu(Xo)U...Uu(Xe)E0)A(A=1+)] 


k 
sii 2 [maca (4.10) 
Xi, X2,... XpeDO t=1 
(X1UX2U...UXp)=4 
X1NX2N...NXk E0 


This rule of combination can be viewed actually as a two-step procedure as follows: 

e Step 1: Evaluate the combination of the sources over the free lattice DO by the classical DSm rule 
of combination to get for all A € DY, S¡(A) = Mus(e)(A) using (1.6). This step preserves the 
commutativity and associativity properties of the combination. When there is no constraint (when 
using the free DSm model), the hybrid DSm rule reduces to the classic DSm rule because Ø = {@} 
and m;(@) = 0, i = 1,...k and therefore (A) = 1 and S2(A) = $3(A) = 0 VA 490 € DP. For 
A = 0, ®(A) = 0 and thus mms (0) = 0. 
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e Step 2: Transfer the masses of the integrity constraints of the hybrid DSm model M according 
to formula (4.7). Note that this step is necessary only if one has reliable information about the 
real integrity constraints involved in the fusion problem under consideration. More precisely, when 
some constraints are introduced to deal with a given hybrid DSm model M(0), there exists some 
propositions A M Ø for which (A) = 0. For these propositions, it is actually not necessary to 
compute Sı(A), S2(A) and S3(4) since the product ®(A)[S|(A) + S2(4) + S3(4)] equals zero 
because ®(A) = 0. This reduces the cost of computations. For propositions A % Ø characterized 
by ®(A) = 1, the derivation of $;(A), S2(A) and $3(A) is necessary to get mue) (4). The last 
part of the hybrid DSm combination mechanism (called compression step) consists in gathering 
(summing) all masses corresponding to same proposition because of the constraints of the model. 
As example, if one considers the 3D frame O = {61, 62, 03) with the constraint 021 03 x Ø, then the 
mass resulting from the hybrid DSm fusion rule (17) muro, (01 U (02 N 03)) will have to be added 


to MM(O) (01) because 01 U (02 N 03) E 0, due to the constraint 02 N 03 = 0. 


The second step does not preserve the full associativity of the rule (same remark applies also with 
Yager’s or Dubois & Prade’s rules), but this is not a fundamental requirement because this problem can 
be easily circumvented by keeping in parallel the two previous steps 1 and 2. The fusion has to start 
always on the free-DSm model. The second step is applied only when some integrity constraints are 
introduced and before the decision-making. In other words, if one has only 2 independent sources of 
information giving mi (.) and ma(.) and some integrity constraints on the frame O, one applies step 1 to 
get Marndi = [mı O ma2](.) defined on the free-DSm model and then one applies step 2 to get the 
final result miao) on the hybrid-model. If a third source of information is introduced, say m3(.), one 


combines it with the two previous ones by step 1 again to get miral = [m3 O musa lO and then 


one applies step 2 to get the final result mtioy (+) on the hybrid-model M(0). 


There is no technical difficulty to process the fusion in this way and that's why the full associativity 
of the fusion rule is not so fundamental despite of all criticisms against the alternatives to Dempster’s 
rules emerging in litterature over the years. The full/direct associativity property is realized only through 
Demspter's rule of combination when working on Shafer’s model. This is one of reasons for which Demp- 
ster’s rule is usually preferred to the other fusion rules, but in turn this associativity property (through 
the normalization factor 1 — m(0)) is also one of the main sources of the criticisms for more than twenty 
years because one knows that Dempster’s rule fails to provide coherent results when conflicts become 
high (see chapters B] and [12] for examples) and something else must be carried out anyway to prevent 
problems. This matter of fact is quite paradoxical. 


2We introduce here the notation m!:?(.) to explicitly express that the resulting mass is related to the combination of 


sources 1 and 2 only. 
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To avoid the loss of information in the fusion, one has first to combine all sources using DSm rule on 
free-DSm model and then to adapt the belief masses according to the integrity constraints of the model 
M. If one first adapts the local masses m1 (.), ...mx(.) to the hybrid-model M and afterwards one applies 
the combination rule, the fusion becomes only suboptimal because some information is lost forever during 
the transfer of masses of integrity constraints. The same remark holds if the transfer of masses of integrity 


constraints is done at some intermediate steps after the fusion of m sources with m < k. 


Let’s note also that this formula of transfer is more general (because we include the possibilities to 
introduce both exclusivity constraints and non-existential constraints as well) and more precise (because 
we explicitly consider all different relative emptiness of elements into the general transfer formula 47) 
than the generic transfer formulas used in the DST framework proposed as alternative rules to Dempster’s 


rule of combination [6] and discussed in section £5.10 


4.5.6 Property of the hybrid DSm Rule 


The following equality holds: 
Y muera) = Y ó(4)[S(4) + S2(4) + Sa(A)] =1 (4.11) 
AEDO AEDS 
Proof: Let's first prove that 97 ¿po m(A) = 1 where all masses m(A) are obtained by the DSm classic 
rule. Let's consider each mass m;(.) provided by the ¿th source of information, for 1 <i < k, as a vector 
of d = | DY | dimension, whose sum of components is equal to one, i.e. m;(DÌ) = [mi, mi2,..., Mia]; 


and a ¿Mij = 1. Thus, for k > 2 sources of information, the mass matrix becomes 


m11 m12 RES Mid 


MeL MEQ ssr Mika 


If one denotes the sets in D? by Aj, Ag, ..., Ag (it doesn’t matter in what order one lists them) then the 
column (j) in the matrix represents the masses assigned to A; by each source of information $1, $2, ..., 
Sp; for example s;(A;) = Mij, where 1 < i < k. According to the DSm network architecture [3], all the 
products in this network will have the form my, M2jz - --Mkjp, i.e. One element only from each matrix 
row, and no restriction about the number of elements from each matrix column, 1 < j1,J2,...,Jx < d. 
Each such product will enter in the fusion mass of one set only from DY. Hence the sum of all components 


of the fusion mass is equal to the sum of all these products, which is equal to 


[om =[[1=1 (4.12) 
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The hybrid DSm rule has three sums S1, S2, and S3. Let's separate the mass matrix M into two disjoint 
sub-matrices Mg formed by the columns of all absolutely and relatively empty sets, and My formed by 


the columns of all non-empty sets. According to the DSm network architecture (for k > 2 rows): 


e S; is the sum of all products resulted from the multiplications of the columns of My following the 
DSm network architecture such that the intersection of their corresponding sets is non-empty, i.e. 
the sum of masses of all non-empty sets before any mass of absolutely or relatively empty sets could 


be transferred to them; 


e S is the sum of all products resulted from the multiplications of Mg following the DSm network 
architecture, i.e. a partial sum of masses of absolutely and relatively empty sets transferred to the 


ignorances in I £ I,U I, or to singletons of O. 


e S3 is the sum of all the products resulted from the multiplications of the columns of My and Mg 
together, following the DSm network architecture, but such that at least a column is from each 
of them, and also the sum of all products of columns of My such that the intersection of their 
corresponding sets is empty (what did not enter into the previous sum S$‘), i.e. the remaining sum 
of masses of absolutely or relatively empty sets transferred to the non-empty sets of the hybrid 


DSm model M. 


If one now considers all the terms (each such term is a product of the form m1;,™2;,...™x;,) of these 
three sums, we get exactly the same terms as in the DSm network architecture for the DSm classic rule, 
thus the sum of all terms occurring in S1, S2, and S3 is 1 (see formula (1.12)) which completes the 
proof. The hybrid DSm rule naturally derives from the DSm classic rule. Entire masses of relatively and 
absolutely empty sets in a given hybrid DSm model M are transferred to non-empty sets according to 


the formula (£7) and thus 


VAEDC DË, mmol) =0 (4.13) 
The entire mass of a relatively empty set (from D9) which has in its expression 0;,, Oj, ..., Oj, with 
1 < r < n will generally be distributed among the 6;,, Oja; ..., 0;, or their unions or intersections, and the 


distribution follows the way of multiplication from the DSm classic rule, explained by the DSm network 
architecture [3]. Thus, because nothing is lost, nothing is gained, the sum of all m me)(A) is equal to 1 
as just proven previously, and fortunately no normalization constant is needed which could bring a loss 


of information in the fusion rule. The three summations $}(.), S3(.) and S3(.) are disjoint because: 


e S;(.) multiplies the columns corresponding to non-empty sets only - but such that the intersections 


of the sets corresponding to these columns are non-empty [from the definition of DSm classic rule]; 
e S2(.) multiplies the columns corresponding to absolutely and relatively empty sets only; 


e S3(.) multiplies: 
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a) either the columns corresponding to absolutely or relatively empty sets with the columns 
corresponding to non-empty sets such that at least a column corresponds to an absolutely or 


relatively emptyset and at least a column corresponds to a non-emptyset, 


b) or the columns corresponding to non-empty sets - but such that the intersections of the sets 


corresponding to these columns are empty. 


The multiplications are following the DSm network architecture, i.e. any product has the above general 
form: Miji M2jz - - -Mkjp, 1.e. any product contains as factor one element only from each row of the mass 
matrix M and the total number of factors in a product is equal to k. The function ¢(A) automatically 
assigns the value zero to the mass of any empty set, and allows the calculation of masses of all non-empty 


sets. 


4.5.7 On the programming of the hybrid DSm rule 


We briefly give here an issue for a fast programming of the DSm rule of combination. Let’s consider 
O = {01,02,..., 0n}, the sources B1, Ba,..., Bk, and p = min{n,k}. One needs to check only the focal 
sets, i.e. sets (i.e. propositions) whose masses assigned to them by these sources are not all zero. Thus, 
if M is the mass matrix, and we consider a set A; in D®, then the column (j) corresponding to Aj, 
Le. (Mij Ma; ... Mpj¿) transposed has not to be identical to the null-vector of k-dimension (0 0 ... 0) 
transposed. Let DY (step,) be formed by all focal sets at the beginning (after sources B1, Ba,..., By have 
assigned masses to the sets in DP). Applying the DSm classic rule, besides the sets in D*(step,) one 


adds r-intersections of sets in DÌ (step; ), thus: 


DP (stepa) = D°(step,) V {Ai A Ai, A... A Ai, } 


where Ai, Aj,, ---, Ai, belong to DP (step,) and 2 < r < p. 


Applying the hybrid DSm rule, due to its S2 and $3 summations, besides the sets in DÌ (stepa) one 


adds r-unions of sets and the total ignorance in DÈ (stepa), thus: 


D®(step3) = DÈ (steps) V Te V {Ai, V Aig V... V Ai} 


where A;,, Ain, ---, Ai, belong to DÌ (stepo) and 2 < r < p. 


This means that instead of computing the masses of all sets in DO, one needs to first compute the 
masses of all focal sets (step 1), second the masses of their r-intersections (step 2), and third the masses 


of r-unions of all previous sets and the mass of total ignorance (step 3). 
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4.5.8 Application of the hybrid DSm rule on previous examples 


We present in this section some numerical results of the hybrid DSm rule of combination for 2 independent 
sources of information. We examine the seven previous examples in order to help the reader to check by 
himself (or herself) the validity of our new general formula. We will not go in details in the derivations, 
but we just present the main intermediary results $,(A), S2(A) and $3(A) (defined in (43), (£9), (1.10)) 
involved into the general formula (4.3) with setting the number of sources to combine to k = 2. Now 
let's consider O = {01,02,03} and two independent bodies of evidence with the generalized basic belief 


assignmentdA mi (.) and ma(.) given in the following tabld4} 


Element A of DO 
0 

01 N 02 N 03 

02 N 03 

01 N 03 

(01 U 82) N 83 


b2 
01 
(91, N 83) U (82 N 63) 
03 

















02 


(02 N 03) U 0; 


01 U 03 
01 U b2 





01 U 02 U 03 
The right column of the table gives the result obtained by the DSm rule of combination based on the 
free-DSm model. The following sections give the results obtained by the hybrid DSm rule on the seven 
previous examples of section{Z.3] The tables show the values of (A), $1(A), S2(A) and S3(A) to help the 
reader to check the validity of these results. It is important to note that the values of Sı (A), S2(4) and 
S3(A) when ¢(A) = 0 do not need to be computed in practice but are provided here only for verification. 


3A general example with m1(4) > 0 and m2(A) > 0 for all A # Ø € D® will be briefly presented in next section. 
“The order of elements of DY is the order obtained from the generation of isotone Boolean functions - see chapter B] 


80 CHAPTER 4. COMBINATION OF BELIEFS ON HYBRID DSM MODELS 


4.5.8.1 Application of the hybrid DSm rule on example 1 


Here is the numerical result corresponding to example 1 with the hybrid-model Mı (i.e with the exclu- 


sivity constraint 01 N 02 N 03 ES Ø). The right column of the table provides the result obtained using the 


hybrid DSm rule, ie. VA € DP, mu, (e) (4) = 6(4)[51(4) + S2(4) + $3(A)] 














Element A of DO 

0 000 0 0 0 
01 N 02 N b3 000001 
02 N 03 000010 
9,083 000011 
(0, U 82) N 93 000111 
03 00100 0 
0,092 001001 
(81 U 83) N 02 001010 
(02 U 03) N 01 E o a a 1 
{(91 N 02) U 83} N (01 U 2) E 0 0 1 1 1 1 
(01 N 62) U 83 011001 
0, 011011 
(81 N 83) U 02 0 1 1 1 1 1 
02 U 03 1 01010 
0; 101011 
(82 N 03) U 01 j 1 0 1 1 11 
01 U b3 1 11011 
0, U b2 CA 1 111 


01 U 02 U 03 





From the previous table of this first numerical example, we see in column corresponding to S3(A) 
how the initial combined mass muse) (01 N 02 N 63) = $1 (01 N 62 N 03) = 0.16 is transferred (due to 
the constraint of Mı) only onto the elements (9, U 02) N 03, (01 U 03) N 02, (82 U 63) N 61, (91 N 02) U 03, 
(0, M03)U02, and (02 N 03) U 01 of DO. We can easily check that the sum of the elements of the column 
for S3(A) is equal to mus (e, (01 N 82 N03) = 0.16 (i.e. to the sum of S1(4) for which ¢(A) = 0) and that 
the sum of S2(4) for which p(A) = 1 is equal to the sum of S3(4) for which ¢(A) = 0 (in this example 
the sum is zero). Thus after introducing the constraint, the initial hyper-power set DY reduces to 18 


elements as follows 


DY, = {0,02 03, 01 N 03, (01 U 02) N 03, 03, 91 N 82, (01 U 03) N 02, (02 U 83) N 61, {(01 N 02) U 43} N (91 U b2), 


(81 N 02) U 83, 02, (01 N 03) U 02, 02 U 03, 01, (82 N 03) U 01, 01 U 03, 01 U 02, 01 U 02 U 03} 
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As detailed in chapter P] the elements of DO, can be described and encoded by the matrix product 
Du, : um, with Dm, given above and the basis vector um, defined? as Um, = [< 1 >< 2 >< 12 >< 
3 >< 13 >< 23 >]'. Actually um, is directly obtained from ums by removing its component < 123 > 
corresponding to the constraint introduced by the model Mı. In general, the encoding matrix Dm for 
a given hybrid DSm model M is obtained from D ms by removing all its columns corresponding to the 
constraints of the chosen model M and all the rows corresponding to redundant /equivalent propositions. 
In this particular example with model Mı, we will just have to remove the last column of D ys to get 
Dm, and no row is removed from D ys because there is no redundant /equivalent proposition involved 


in this example. This suppression of some rows of D yys will however occur in the next examples. 


4.5.8.2 Application of the hybrid DSm rule on example 2 


Here is the numerical result corresponding to example 2 with the hybrid-model Mg (i.e with the exclu- 


M M 
sivity constraint 61 62 = Ø => 01 N 02 N03 = Ø). One gets now 


Element A of DO (A) SI(A) $2(A) S3(A) | Mua(o,(4) 
0 

61116263 E y 

0, 003 

01,003 

(0, U 2) N bs 


ba E 0.00 


o 0.00 


Ma 
(01 N 03) U (02 N 03) = 


0 2 03 


(01 U 02) N 03 























5D us was denoted Dn and umf as Un in chapter [2] 
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From the previous table of this numerical example, we see in the column corresponding to $3(A) 
how the initial combined masses Muse) (41 N 82 N 03) = S1 (81 N 02 N 03) = 0.16 and mys(e)(01 N 02) = 
51 (01 N02) = 0.22 are transferred (due to the constraint of M2) onto some elements of DP. We can easily 
check that the sum of the elements of the column for S3(4) is equal to 0.16 + 0.22 = 0.38 (i.e. to the 
sum of S¡(A) for which ¢(A) = 0) and that the sum of S2(4) for which ¢(A) = 1 is equal to the sum of 
53(A) for which ¢(A) = 0 (this sum is 0.02). Because some elements of DO are now equivalent due to the 
constraints of M2, we have to sum all the masses corresponding to same equivalent propositions/elements 
(by example {(0102)U63} (61 U@2) ES (0,U02)M03). This can be viewed as the final compression step. 


One then gets the reduced hyper-power set De, having now 13 different elements with the combined 


belief masses presented in the following table. 


The basis vector um, and the encoding matrix Dm, for the elements of Ds are given by Um, = 
[< 1 >< 2 >< 3 >< 13 >< 23 >]' and below. Actually um, is directly obtained from ums by removing 


its components < 12 > and < 123 > corresponding to the constraints introduced by the model M2. 


Element A of DY, 
0 0 








00 0 
0 0 

00001 
02 N 03 0.19 + 0.07 = 0.26 

000 10 
6,963 0.12 + 0.02 = 0.14 

00011 
(0; U 02) N43 0.03 + 0 = 0.03 

001141 
03 0.10 + 0.07 = 0.17 

01001 
bə 0.08 

and Dm: =|0 1 0 1 1 

(0, N 03) UO, 0.01 

0 1 111 
02 U 03 0 

10010 
0 

1 0 0 1 1 
(02 N 03) U 01 

1 0 1 1 1 
6, U 03 

1 1 0 1 1 
6; U b2 

1 1 1 1 1 
01 U 02 U 63 
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4.5.8.3 Application of the hybrid DSm rule on example 3 


Here is the numerical result corresponding to example 3 with the hybrid-model M3 (i.e with the exclu- 


M M M 
sivity constraint (0, U 03) N 02 = 0). This constraint implies directly 01 N 82 N03 = Ø, 0:02 Æ ( and 


02 N 63 25 0, One gets now 


Element A of DO 
0 

91 N 02 N 83 
0000 


Ms 


01 N 03 


0, 
0 


3 


ESE 


>0 
01003 
(9, N 93) U (02 N 63) 


0s 65 























We see in the column corresponding to S3(4) how the initial combined masses mms (0) ((01U63)N 02) = 
S1((01U03)N02) = 0.05, mus (e) (010203) = S1 (01N02N03) = 0.16, Mm ms (ey (02003) = S1(9203) = 0.19 
and Mus(ey (01 N 02) = S1(01 N 02) = 0.22 are transferred (due to the constraint of M3) onto some 
elements of DP. We can easily check that the sum of the elements of the column for $3(A) is equal to 
0.05 + 0.16 + 0.19 + 0.22 = 0.62 (i.e. to the sum of S¡(4) for which ¢(A) = 0) and that the sum of 52(A) 
for which p(A) = 1 is equal to 0.02+0.02 = 0.04 (i.e. to the sum of S3(4) for which (A) = 0). Due to the 
model M3, one has to sum all the masses corresponding to same equivalent propositions. Thus after the 
final compression step, one gets the reduced hyper-power set DF, having only 10 different elements with 
the following combined belief masses. The basis vector um, is given by um, = [< 1 >< 2 >< 3 >< 13 >] 


and the encoding matrix D m, is shown just right after. 
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Element A of DP 
Ma 0 0 











0 0 
0 
0 0 1 
0.12 + 0.03 + 0.02 + 0 = 0.17 
0 0 1 1 
03 0.16 + 0.07 = 0.23 
0 1 0 0 
02 0.12 
0 1 0 1 
(01 N 03) U 02 0.01 and Dms = 
0 1 1 1 
02 U 03 0.05 
1 0 0 1 
0 0.12 + 0.04 = 0.16 
1 0 1 1 
6, U b3 0.08 
1101 
01 U 02 0.11 
A LL 
01 0.07 


4.5.8.4 Application of the hybrid DSm rule on example 4 (Shafer’s model) 


Here is the result obtained with the hybrid-model Ma, i.e. Shafer’s model. 
Element A of DÈ 
0 
010203 Lp 
000 0 
an 0 
Ma 


(01 U 02) N 03 = 0 


[S 


A 
a == 


ETS 


(01 1.03) U (ba N 3) E 0 


Ma 


93 93 
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From the previous table of this numerical example, we see in column corresponding to $3(A) how the 
initial combined masses of the eight elements forced to the empty set by the constraints of the model 
Ma are transferred onto some elements of DO. We can easily check that the sum of the elements of the 
column for S3(A) is equal to 0.16 + 0.19 + 0.12 + 0.01 + 0.22 + 0.05 = 0.75 (i.e. to the sum of S¡(4) for 
which ¢(A) = 0) and that the sum of 52(A) for which ¢(A) = 1 is equal to the sum of S3(4) for which 
(A) = 0 (this sum is 0.02 + 0.06 = 0.08 = 0.02 + 0.02 + 0.02 + 0.02). 


After the final compression step (i.e. the clustering of all equivalent propositions), one gets the reduced 
hyper-power set DÌ, having only 2% = 8 (corresponding to the classical power set 29%) with the following 


combined belief masses: 


Element A of DY, 
0 0 





0 
0 
0 0 1 
- 0.07 = 0.24 
010 
- 0.01 = 0.13 
0 11 
0.05 and Dm, = 
10 0 
- 0.04 = 0.18 
1 0 1 
0.17 
1 1 0 
0.11 
pd 
0.12 











The basis vector um, is given by um, = [< 1 >< 2 >< 3 >]' and the encoding matrix Dm, is shown 


just above. 


4.5.8.5 Application of the hybrid DSm rule on example 5 


The following table presents the numerical result corresponding to example 5 with the hybrid-model M5 


M M 
including the non-existential constraint 0, = Ø. This non-existential constraint implies 0102003 = 0, 


ano E 0, 0,63 l and (02 U3) 16, Ly. 


From the table, we see in the column corresponding to S3(A) how the initial combined masses of 
the 5 elements forced to the empty set by the constraints of the model Ms are transferred onto some 
elements of DO. We can easily check that the sum of the elements of the column for $3(A) is equal 
to 0+ 0.16 + 0.12 + 0.22 + 0 + 0.08 = 0.58 (Le. to the sum of Sı(A) for which (A) = 0) and that 
the sum of S2(4) for which ¢(A) = 1 is equal to the sum of S3(4) for which ¢(A) = 0 (this sum is 
0.02 + 0.06 + 0.04 = 0.12 = 0.02 + 0.02 + 0.08). 
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Element A of D® 
0 
6196.03 2 0 


02 N 03 
Ms 


91,003 0 
(0; U 02) N 03 


Ls 02 N 03 


5 


> 
TESES 
= D 


2 N 03 


D 
fin 


(01 N 03) U (62 N 03) L 9,00, 


0s "£ 65 























After the final compression step (i.e. the clustering of all equivalent propositions), one gets the reduced 


hyper-power set DF, having only 5 different elements according to: 





Element A of DS, mms) (4) 
0.0.0 

0 
0 0 1 

0.19 + 0.03 + 0.07 + 0 + 0.04 = 0.33 
and Du;=|0 1 1 

0.11 + 0.07 + 0.21 = 0.39 
1 0 1 

0.08 + 0.01 + 0.15 = 0.24 
1 1 1 

0 + 0.04 = 0.04 








The basis vector um, is given by um; = [< 2 >< 3 >< 23 >). and the encoding matrix Dm, is 


shown just above. 
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4.5.8.6 Application of the hybrid DSm rule on example 6 


Here is the numerical result corresponding to example 6 with the hybrid-model Mg including the two non- 
M M 

existential constraint 6; = Ø and 02 = Ø. This is a degenerate example actually, since no uncertainty 

arises in such trivial model. We just want to show here that the hybrid DSm rule still works in this 


example and provide a legitimate result. By applying the hybrid DSm rule of combination, one now gets: 


Element A of DÈ 
0 

81 N 02N 03 

000 0 


Mi 


aa) 


o 


= 
mS mī 
A = 


(0: 1.03) U (629 43) E y 

















— 


E “E 
SS 
WS Ws 








We can still verify that the sum of S3(4) (i.e. 0.88) equals the sum of $;(A) for which ¢(A) = 0 and 
that the sum of S2(A) for which ¢(A) = 1 (i.e. 0.42) equals the sum of S3(4) for which $(A) = 0. After 


the clustering of all equivalent propositions, one gets the reduced hyper-power set Dy. having only 2 


different elements according to: 


0 
0.17 + 0.07 + 0.09 + 0.23 + 0.44 = 1 





The encoding matrix Dm, and the basis vector um, for the elements of DQ}, reduce to Dm, = [01] 


and um, = [< 3 >]. 
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4.5.8.7 Application of the hybrid DSm rule on example 7 


Here is the numerical result corresponding to example 7 with the hybrid-model M7 including the mized 


exclusivity and non-existential constraint (9,M02)U03 =' Ø. This mixed constraint implies 016263 


ano E 0, 0103 0,008 0, (01 U 02) 1.45 


{(01 N02) UO3}. (01 U02) L 0 and 63 


Element A of DS 
0 


M7 


0i NAN 03 = 0 


q 


3 
oa <= 


D 
Ws Ws 























M7 


= 0, 
0, (02 U 03) N 0, 


Ø. By applying the hybrid DSm rule of combination, one gets: 


M7 


Mr Mr 


0, (9, U 03) N 02 


y) 
Mr 


M7 


(01 N 83) U (02 N 63) = 0 


After the clustering of all equivalent propositions, one gets the reduced hyper-power set Dy, having 


only 4 different elements according to: 





+ 0.11 = 0.24 





+ 0.25 = 0.43 





- 0.22 = 0.33 
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The basis vector um, and the encoding matrix D m, for the elements of Dei. are given by 


0 0 

0 1 
um, =[<1><2>! and Du. = 

1 0 

1 1 


We can still verify that the sum of S3(4) (i.e. 0.85) equals the sum of Sı (A) for which ¢(A) = 0 and 
that the sum of S2(A) for which ¢(A) = 1 (i.e. 0.25) equals the sum of S3(A) for which ọ(A) = 0. 


4.5.9 Example with more general basic belief assignments m;(.) and ma(.) 


We present in this section the numerical results of the hybrid DSm rule of combination applied upon the 
seven previous models M;, i = 1,...,7 with two general basic belief assignments mj (.) and ma(.) such 
that m (A) > 0 and m2(A) > 0 for all A 4 Ø € DO=101,02,03) We just provide here the results. The 
verification is left to the reader. The following table presents the numerical values chosen for mi (.) and 


ma(.) and the result of the fusion obtained by the classical DSm rule of combination 


Element A of DÈ 
0 

01 N 02 N 03 

02 N 03 

01 N 03 

(01 U 82) N 03 


02 
0 
(6; 0103) U (9,0103) 
03 

















0, 


(0 N03) UO, 


01 U 03 
0, U b2 
01 U 02 U 03 
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The following table shows the results obtained by the hybrid DSm rule before the final compression 


step of all redundant propositions for the hybrid DSm models presented in the previous examples. 


Element A of DÈ 

0 0 

01 N b2 N 03 0 

02 N 03 0.0573 
01 N 03 0.0621 


(0, U 02) N 03 0.0324 


0.0435 
0 

0 0.0365 

0 0.0719 

(0, N 03) U (02 N 63) 0.0704 

0 0.0613 


oc0u0 0 Ou €0C0 0 COCO. € OCO—, VO O. O 


0.0207 

















b2 0.0309 
0.1346 
0.0175 


oc0óuúu0, ,  €0 0 CO O GO OOO 0 O Oo O O O O 


(02 N 03) U 01 0.0229 
U b 0.0385 








0 
01 U 02 0.0412 
01 U 02 U 03 0.2583 


The next tables present the final results of the hybrid DSm rule of combination after the compression 


step (the merging of all equivalent redundant propositions) presented in previous examples. 


Element A of DE. mm5(@)(A) 


Element A of DQ, Mm.(oy(A) 


Element A of DF, Muse) (A) 


0 
1 





On example no. 7 On example no. 6 On example no. 5 
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Element A of DS, Muse) (A) 


Element A of Dí mma(e) (A) 


03 

02 

(9,063) U b2 
0z Us 

01 
0, U0s 
01 U 02 














0 





On example no 4 On example no 3 


Element A of Dy. 
0 
02 N 03 
01 N 03 
(01 U 02) N 03 
Element A of DQ,, Mma(o) (4) 
0 
02.7 03 b2 
01 N 03 91 
(9, U 02) N 03 (01 N 03) U (02 N 03) 
03 93 
02 

















(0, 03) U02 0 
02 U 03 

6 
(62 03) U 04 (6203) U 01 
U Os 6, U0s 





0 
01 U b2 01 U b2 
0 


U 02 U 03 01 U 02 U 03 





On example no 2 On example no 1 
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4.5.10 The hybrid DSm rule versus Dempster’s rule of combination 


In its essence, the hybrid DSm rule of combination is close to Dubois and Prade’s rule of combination 
(see chapter [] and [4]) but more general and precise because it works on DP > 2% and allows us to 
include all possible exclusivity and non-existential constraints for the model one has to work with. The 
advantage of using the hybrid DSm rule is that it does not require the calculation of weighting factors, 
nor a normalization. The hybrid DSm rule of combination is definitely not equivalent to Dempster’s rule 


of combination as one can easily prove in the following very simple example: 


Let's consider O = {01,02} and the two sources in full contradiction providing the following basic 


belief assignments 


m1(01) =1 m1(02) = 0 
ma(01) = 0 ma(02) =1 


Using the classic DSm rule of combination working with the free DSm model M?, one gets 


mays (1) =0 Mrs (82) = 0 mrs(01 M02) = 1 mayyt (1 U 02) =0 


If one forces 6; and 62 to be exclusive to work with Shafer’s model M°, then the Dempster’s rule of 
combination can not be applied in this limit case because of the full contradiction of the two sources of 
information. One gets the undefined operation 0/0. But the hybrid DSm rule can be applied in such 
limit case because it transfers the mass of this empty set (0; N 62 = Ø because of the choice of the model 


M?) to non-empty set(s), and one gets: 


mmo(91) = 0 mmo(02) = 0 myyo(01M 62) = 0 myyo(O, U 62) = 1 


This result is coherent in this very simple case with Yager’s and Dubois-Prade’s rule of combination [DE]. 


Now let examine the behavior of the numerical result when introducing a small variation e > 0 on 


initial basic belief assignments my (.) and ma(.) as follows: 


m1(01) =1—e m1(02) = € and ma(01) =€ mal[0,) =1—e 


As shown in figure[Z.2] lim. 0 mps(.), where mps(.) is the result obtained from the Dempster’s rule 


of combination, is given by 


mps(01) =0.5 mps(02) = 0.5 mps(0, N 02) =0 mps(61 U 0) =0 


This result is very questionable because it assigns same belief on 01 and 02 which is more informational 


than to assign all the belief to the total ignorance. The assignment of the belief to the total ignorance 
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appears to be more justified from our point of view because it properly reflects the almost total contra- 
diction between the two sources and in such cases, it seems legitimate that the information can be drawn 
from the fusion. When we apply the hybrid DSm rule of combination (using Shafer’s model M0), one 
gets the expected belief assignment on the total ignorance, i.e. m,,o(01 U2) = 1. The figure below shows 
the evolution of belief assignments on 01, 02 and 61 U 62 with e obtained with the classical Dempster rule 


and the hybrid DSm rule based on Shafer’s model M° (i.e. 61 N 02 Lo ). 


Evolution of m(@,) with e Evolution of m(@,) with € Evolution of m(@, We.) with e 
1.5 1.5 1.5 











1+p 4 af 4 terse. | 






























































LS os; LS o0.5|p Oo.5sp | 
= = 
= 4 op ob _ |] 
— Dempster rule — Dempster rule — Dempster rule 
— DSm hybrid rule — DSm hybrid rule — DSm hybrid rule 
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Figure 4.2: Comparison of Dempster's rule with the hybrid DSm rule on O = (6,02) 


4.6 Dynamic fusion 


The hybrid DSm rule of combination presented in this paper has been developed for static problems, 
but is also directly applicable for easily handling dynamic fusion problems in real time as well, since at 
each temporal change of the models, one can still apply such a hybrid rule. If DP changes, due to the 
dynamicity of the frame O, from time t; to time t¡,1, i.e. some of its elements which at time tı were not 
empty become (or are proven) empty at time tı+1, or vice versa: if new elements, empty at time t,, arise 
non-empty at time tj41, this hybrid DSm rule can be applied again at each change. If O stays the same 


but its set non-empty elements of DY increases, then again apply the hybrid DSm rule. 


4.6.1 Example 1 
Let's consider the testimony fusion EN with the frame 

O(t1) £ (0, = young, 02 = old, 03 = white hairs} 
with the following two basic belief assignments 


m1 (01) =0.5 my (03) =0.5 and ma(02) =0.5 ma(03) =0.5 


6This problem has been proposed to the authors in a private communication by L. Cholvy in 2002. 
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By applying the classical DSm fusion rule, one then gets 
MMmf(O(t1)) (01 N 02) = 0.25 MMmf(O(t1)) (01 N 03) = 0.25 


mms (olt) (02 N 03) = 0.25 mms (@(t,)) (03) = 0.25 


Suppose now that at time t¡,1, one knows that young people don’t have white hairs (i.e 6,63 = 0). How 
can we update the previous fusion result with this new information on the model of the problem? We 
solve it with the hybrid DSm rule, which transfers the mass of the empty sets (imposed by the constraints 
on the new model M available at time t,+1) to the non-empty sets of DÌ, going on the track of the DSm 


classic rule. Using the hybrid DSm rule with the constraint 6; N 63 = Ø, one then gets: 


and the mass mm(01ı N 03) = 0, because 01 N 03 = {young} N {white hairs} E 0 and its previous mass 


mms (ot) (0 N 03) = 0.25 is transferred to myu(01 U 63) = 0.25 by the hybrid DSm rule. 


4.6.2 Example 2 


Let O(t,) =(01,02,...,0, ) be a list of suspects and let's consider two observers who eyewitness the scene 
of plunder at a museum in Bagdad and who testify to the radio and TV the identities of thieves using the 
basic beliefs assignments m4(.) and ma(.) defined on DOC), where t; represents the time of the observa- 
tion. Afterwards, at time t¡+1, one finds out that one suspect, among this list O(t;), say 6;, could not be 
a suspect because he was on duty in another place, evidence which was certainly confirmed. Therefore he 
has to be taken off the suspect list O(t,), and a new frame of discernment results in O(t,+1). If this one 
changes again, one applies again the hybrid DSm of combining of evidences, and so on. This is a typically 
dynamical example where models change with time and where one needs to adapt fusion results with the 
current model over time. In the meantime, one can also take into account new observations/testimonies 


in the hybrid DSm fusion rule as soon as they become available to the fusion system. 


If © (and therefore D®) diminish (i.e. some of their elements are proven to be empty sets) from time 
tı to time tı+1, then one applies the hybrid DSm rule in order to transfer the masses of empty sets to the 
non-empty sets (in the way of the DSm classic rule) getting an updated basic belief assignment my, , jt, (-). 
Contrarily, if O and DY increase (i.e. new elements arise in O, and/or new elements in DP are proven 
different from the empty set and as a consequence a basic belief assignment for them is required), then 
new masses (from the same or from the other sources of information) are needed to describe these new 


elements, and again one combines them using the hybrid DSm rule. 
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4.6.3 Example 3 


Let's consider a fusion problem at time t; characterized by the frame @(t;) = (01,02) and two independent 
sources of information providing the basic belief assignments m,(.) and ma(.) over DOC) and assume 
that at time tj41 a new hypothesis 03 is introduced into the previous frame O(¢,) and a third source of 


evidence available at time t,+1 provides its own basic belief assignment m3(.) over DO(1+) where 
O(t1+1) 3 (0(t1), 03) = (01, 02, 03) 
To solve such kind of dynamical fusion problems, we just use the classical DSm fusion rule as follows: 


e combine m1(.) and ma(.) at time tı using classical DSm fusion rule to get m12(.) = [m1 O ma](.) 


over DO(t) 


e because DOY c D+), mix(.) assigns the combined basic belief on a subset of DO®+1), it is 
still directly possible to combine my2(.) with mg(.) at time tj41 by the classical DSm fusion rule to 


get the final result ma23(.) over DO(U+1) given by 





Mayil) = mia (.) = [miz O ma](.) = [(m1 E m2) O ma]() = [m1 © ma O mg](.) 


e eventually apply hybrid DSm rule if some integrity constraints have to be taken into account in the 


model M of the problem 


This method can be directly generalized to any number of sources of evidences and, in theory, to any 
structure/dimension of the frames O(t;), O(ti41), ... In practice however, due to the huge number of 
elements of hyper-power sets, the dimension of the frames O(t;), O(ti41), ... must be not too large. This 
practical limitation depends on the computer resources available for the real-time processing. Specific 
suboptimal implementations of DSm rule will have to be developed to deal with fusion problems of large 


dimension. 


It is also important to point out here that DSmT can easily deal, not only with dynamical fusion 
problems but with decentralized fusion problems as well working on non exhaustive frames. For example, 
let's consider a set of two independent sources of information providing the basic belief assignments my (.) 
and ma(.) over DO(t1)=101,02+ and another group of three independent sources of information providing 
the basic belief assignments m3(.), ma(.) and m5(.) over DOs15(11)=103,04,05,06) then it is still possible to 


combine all information in a decentralized manner as follows: 


e combine m1(.) and ma(.) at time tı using classical DSm fusion rule to get m12(.) = [mi O ma](.) 


over D82), 


e combine m3(.), ma(.) and ms(.) at time t; using classical DSm fusion rule to get mga5(.) = [m3 O 


ma ® m5](.) over D935 (ti). 
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e consider now the global frame O(t,) = (0 12(t1), O345(t1)). 


e eventually apply hybrid DSm rule if some integrity constraints have to be taken into account in the 


model M of the problem. 


Note that this static decentralized fusion can also be extended to decentralized dynamical fusion also 


by mixing the two previous approaches. 


One can even combine all five masses together by extending the vectors m;(.), 1 < i < 5, with null com- 
ponents for the new elements arisen from enlarging O to {61, 42, 03, 04, 05 } and correspondingly enlarging 


DY, and using the hybrid DSm rule for k = 5. And more general combine the masses of any k > 2 sources. 


We give now several simple numerical examples for such dynamical fusion problems involving non 


exclusive frames. 


4.6.3.1 Example 3.1 


Let's consider O(t) £ {01,02} and the two following basic belief assignments available at time t: 


m1(01) =0.1 my (@2) = 0.2 m4 (64 U 92) =0.3 m1(01 N 02) = 0.4 
ma(01) = 0.5 ma(02) = 0.3 ma(01 U 92) = 0.1 ma(01 N 02) = 0.1 


The classical DSm rule of combination gives 


m12(01) = 0.21 m12(02) = 0.17 my42(01 U 92) = 0.03 m12(01 N 92) = 0.59 


Now let's consider at time t¡y1 the frame O(t¡,1) £ {01,02,03} and a third source of evidence with 


the following basic belief assignment 


mg3(03) =0.4 m3(01 N 03) = 0.3 ms3(62 U 93) =0.3 
Then the final result of the fusion is obtained by combining m3(.) with mi2(.) by the classical DSm rule 


of combination. One thus obtains: 


m123(01 102M 03) = 0.464 m123(02 N03) = 0.068 my423(61 63) = 0.156 my423( (64 U82) N63) = 0.012 


my423(61 N 02) = 0.177 my423(61 N (02 U 93)) = 0.063 m123(02) = 0.051 my423((61 N 03) U 92) = 0.009 
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4.6.3.2 Example 3.2 


Let's consider O(t,) = (01,02) and the two previous following basic belief assignments m1 (.) and ma(.) 


available at time t;. The classical DSm fusion rule gives gives as before 
m12(01) = 0.21 m12(02) = 0.17 m42(64 U 02) = 0.03 mi2(01 N 02) = 0.59 


Now let's consider at time t;41 the frame O(t,+1)  {01, 02, 03} and the third source of evidence as in 


previous example with the basic belief assignment 
m3(63) = 0.4 ms3(61 N 03) = 0.3 m3(02 U 63) = 0.3 


The final result of the fusion obtained by the classical DSm rule of combination corresponds to the result 
of the previous example, but suppose now one finds out that the integrity constraint 03 = Ø holds, which 
implies also constraints 619 82 N 03 = Ø, 01903 = 0, 02 N 03 =@ and (6; U02) N83 = Ø. This is the hybrid 
DSm model M under consideration here. We then have to readjust the mass m123(.) of the previous 


example by the hybrid DSm rule and one finally gets 
mml(01) = 0.147 mm (02) = 0.060 + 0.119 = 0.179 


mm (61 U 62) =0+0+0.021 = 0.021  myu(0 N 02) = 0.240 + 0.413 = 0.653 


Therefore, when we restrain back 03 = Ø and apply the hybrid DSm rule, we don’t get back the same 
result (ie. myu(.) 4 mi2(.)) because still remains some information from mg(.) on 01, 02, 01 U 02, or 


an 02, i.e. m3(62) =0.3> 0. 


4.6.3.3 Example 3.3 


Let's consider O(t,) £ {01,02} and two previous following basic belief assignments m1(.) and ma(.) 


available at time t;. The classical DSm fusion rule gives as before 
m12(01) = 0.21 m12(02) = 0.17 mi2(01 U 02) = 0.03 mi2(01 N 02) = 0.59 


Now let's consider at time t;, 1 the frame O(t,41) £ (01,02, 03,04) and another third source of evidence 


with the following basic belief assignment 
m3(03) =0.5 ma(04) = 0.3 ma(03 N 94) = 0.1 m3(03 U 04) = 0.1 


Then, the DSm rule applied at time t;+1 provides the following combined belief assignment 


mi23(01N03) = 0.105 mı23(01N04) = 0.063 mı23(01N(03U04)) = 0.021 mı23(91 N83N64) = 0.021 
m123(0263) = 0.085 mı23(92N04) = 0.051 mMı23(92N (03 U04)) = 0.017 mı23(02N03 N04) = 0.017 
m1ı23(03 N (81 U02)) = 0.015 my23(04 (01 U 62)) = 0.009 m123((81 U 02) N (03 U 04)) = 0.003 
m1ı23((91 U 62) N (83 N 04)) = 0.003 1m123(01 182 63) = 0.295 mı23(01 N 92M 94) = 0.177 


my423((61 N 92) N (03 U 94)) = 0.059 mi23(01 N b2 N 83 N 64) = 0.059 
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Now suppose at time t¡,2 one finds out that 03 = 04 = Ú, then one applies the hybrid DSm rule after 
re-adjusting the combined belief mass m123(.) by cumulating the masses of all empty sets. Using the 


hybrid DSm rule, one finally gets: 


Ma, (01) = mi23(01) + [mi2(01)m3(03) + m12(01)m3(04) + mi2(01)m3(03 U 04) + mi2(01)m3(03 N Aa) t 


= 0 + {(0.21 x 0.5) + (0.21 x 0.3) + (0.21 x 0.1) + (0.21 x 0.1)} = 0.21 


M4, (02) = 123 (82) + (mi2(02)m3(03) + m12(02)m3(04) + m12(02)m3(03 U 04) + mi2(02)m3(03 N 04), 


= 0 + {(0.17 x 0.5) + (0.17 x 0.3) + (0.17 x 0.1) + (0.17 x 0.1)} = 0.17 


Meta (01 U 92) = m1923(01 U 92) + £m12(01 U 02)m3(03) + my42(01 U 02)m3(04) 


T mı2(0ı U 92)m3(03 U 64) + m12(01 U 02)m3 (03 n 04)} 





+ eN m12(X1)m3(X2) 
X1,X2€103,04,03U04,03004) 


= 0 + {(0.03 x 0.5) + (0.03 x 0.3) + (0.03 x 0.1) + (0.03 x 0.1)) + {0} = 0.03 





Meta (01 N 02) = m123(01 N 02) e £m12(01 N 02)m3(03) + m12(01 N b2)M3 (84) 
+ mı2(0ı N 02)m3(03 U 04) + my42(01 N 02)mM3 (03 N 64)} 


= 0 + {(0.59 x 0.5) + (0.59 x 0.3) + (0.59 x 0.1) + (0.59 x 0.1)} = 0.59 


Thus we get the same result as for m12(.) at time tı, which is normal. 


Remark: note that if the third source of information doesn’t assign non-null masses to 61, or 92 (or 
to their combinations using U or N operators), then one obtains the same result at time tj; as at time t; 
as in this example 3.3, i.e. mi+2(.) = mi(.), when imposing back 03 = 64 = 0. But, if the third source of 
information assigns non-null masses to either 01, or 02, or to some of their combinations 6; U2 or 01 N02, 
then when one returns from 4 singletons to 2 singletons for O, replacing 63 = 94 = @ and using the hybrid 
DSm rule, the fusion results at time t¡,2 is different from that at time tı, and this is normal because some 
information/mass is left from the third source and is now fusioned with that of the previous sources (as 


in example 3.2 or in the next example 3.4). 


In general, let's suppose that the fusion of k > 2 masses provided by the sources B;, Ba, ..., By has 
been done at time tı on O(t;) = (01,02,...,0,,). At time tj41 new non-empty elements 6n41, On+2, ..., 
On+m appear, m > 1, thus O(tj41) = ([01,02,....0n,0n+1,0n+2,---,0n+m) and of course one or more 
sources (i.e. bodies of evidences) Bx+1, ..., Bk, where | > 1, appear to assign masses to these new 


elements. 
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a) If all these new sources Bx+1, ..., Bx+1 assign null masses to all elements from D® (+1) which 


contain in their structure/composition at least one of the singletons 61, 02, ..., On, then at time 








ti+2 if one sets back the constraints that @n41 = On+2 =... = Ontm = Í, then using the hybrid 


DSm rule, one obtains the same result as at time tz, ie. mi+2(.) = mi(.). 


b) Otherwise, the fusion at time t¡y2 will be different from the fusion at time tı because there still 
remains some information/mass from sources B41, ..., Bk+ on singletons 61, 02, ..., An or on some 
elements from D°“) which contain at least one such singleton, information /mass which fusions with 


the previous sources. 


4.6.3.4 Example 3.4 
Let's consider O(t) £ 101,02) and the two following basic belief assignments available at time tz: 
m1 (01) = 0.6 mı (02) = 0.4 and ma(01) =0.7 ma(02) = 0.3 


The classical DSm rule of combination gives m12(0,) = 0.42, mi2(02) = 0.12 and m12(01 N 62) = 0.46. 
Now let's consider at time t¡,1 the frame O(ti,1) £ {01, 02,03} and a third source of evidence with the 
following basic belief assignment m3(01) = 0.5, m3(02) = 0.2 and m3(03) = 0.3. Then the final result 


obtained from the classical DSm rule of combination is still as before 
m123(01) = 0.210 m1923(02) = 0.024 m123(01 N 92) = 0.466 mi23(01 N 03) = 0.126 
mi23 (02 N 03) = 0.036 mi23(01 nn 63) = 0.138 


Suppose now one finds out that the integrity constraint 0,03 = Ø which also implies 01, N02N 03 = Ó. 
This is the hybrid DSm model M under consideration. By applying the hybrid DSm fusion rule, one 
forces mm (01 N 03) = 0 and myu(01 N 02 N 03) = 0 and we transfer m123(01 N 02 N 03) = 0.138 towards 
mm ((01 N 02) U 03) and the mass m123(01 N 63) = 0.126 has to be transferred towards m m (41 U 03). One 
then gets finally 

mm(01) = 0.210 may (62) = 0.024 my(010 62) = 0.466 mp4 (O29 63) = 0.036 


mu ((64 N 92) U 63) = 0.138 mm (64 U 93) = 0.126 


4.6.3.5 Example 3.5 


Let's consider O(t,) £ (01,02) and the two previous basic belief assignments available at time t; as in 


previous example, i.e. 
m1(01) = 0.6 my (02) = 0.4 and ma(01) = 0.7 ma(02) = 0.3 
The classical DSm rule of combination gives 


m12[01) = 0.42 m12(02) = 0.12 m12 (0, N 92) = 0.46 
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Now let's consider at time t¡,1 the frame O(ti,1) £ {01, 02,03} and a third source of evidence with the 


following basic belief assignment 
m3(61) = 0.5 m3(02) = 0.2 m3(03) = 0.3 
Then the final result of the fusion is obtained by combining m3(.) with m12(.) by the classical DSm rule 
of combination. One thus obtains now 
m123(01) = 0.210 mi23(02) = 0.024 mi23(01 N 02) = 0.466 mi23(01 N 03) = 0.126 


m123(02 N 03) = 0.036 mı23(01 N b2 N 03) = 0.138 


But suppose one finds out that the integrity constraint is now 03 = Ø which implies necessarily also 
01 N 03 = 02 N 03 = 01 N 02 N 03 = Ý and (01 U 62) N 63 = Ü (this is our new hybrid DSm model M under 
consideration in this example). By applying the hybrid DSm fusion rule, one gets finally the non-null 
masses 


mm (01) = 0.336 mpm(02) = 0.060  mm(0i N 02) = 0.604 


4.6.3.6 Example 3.6 
Let's consider O(t) £ {01, 02, 03, 04} and the following basic belief assignments available at time t; : 
m1(01) = 0.5 m1 (02) = 0.4 m1(01 N 62) =0.1 
m2(61) = 0.3 ma(02) = 0.2 ma(01 N 83) = 0.1 m2(64) = 0.4 
The classical DSm rule of combination gives 
my2(61) =0.15  mi2(0)=0.08.  mi2(01002) =0.27  mi2(01063)=0.05  m12(0,004) = 0.20 
mi2(02 04) = 0.16 — my2(016263) = 0.05  m1(61 N 82N 04) = 0.04 


Now assume that at time t¡,1 one finds out that 0, N 02 4 81 N 03 Y Ø. Using the hybrid DSm rule, one 


gets: 
mm (64 N 02) = mm (61 N 63) = mm (64 N 0 N 03) = mm (64 N bə N 04) =0 


mm(01) = mı2(01) T m2(61)m1 (01 f 92) T m1(01)ma(01 N 03) = 0.15 0.03 0.05 = 0.23 

















MM (02) = m12(02) Te ma(02)m: (01 N 92) + mi (92)ma(01 N 03) = 0.08 0.02 0.04 = 0.14 





MM (04) = mi2(04) T My, (01 N b2)M2 (84) = 0 + 0.04 = 0.04 
mm (64 N 04) = mı2(0ı N 94) = 0.20 
maul02 N 04) = my42(02 N 94) = 0.16 


mm (64 U 92) = mi2(01 U 02) + m1(0,)ma(02) + ma(01)m; (02) + m1 (64 N 92)ma(01 f 92) = 0.22 








mm (64 U @2 U 03) = my42(61 U 02 U 03) + mı(0ı N 92)ma(01 N 03) + ma(01 N b2)mı (64 (1 63) 


+m1(01 N A N 03)m2(01 N A N 03) = 0.01 
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4.6.3.7 Example 3.7 


Let's consider O(t) £ {01, 02, 03, 04} and the following basic belief assignments available at time t; : 


m1(01) = 0.2 m1 (02) = 0.4 m1 (64 N 92) = 0.1 m1(01 N 03) = 0.2 m1(04) = 0.1 
ma(01) = 0.1 ma(02) =0.3 ma(01 N 02) = 0.2 ma(01 N 03) =0.1 ma(04) = 0.3 


The classical DSm rule of combination gives 
m12(01) = 0.02 m12(02) = 0.12 m12 (01 N 92) = 0.28 mi2(01 N 03) = 0.06 mı2(04) = 0.03 


my2(01 N 04) = 0.07 m12(02 04) = 0.15 mi2(01 N 02M 03) = 0.15 
m12(01 N 02 N 04) = 0.05 m12(01 N 03 N 04) = 0.07 
Now assume that at time t¡,1 one finds out that 0, N 02 x 81 N 03 E Ø. Using the hybrid DSm rule, one 
gets: 
mnm(01 N 62) = myu(01, N 63) = mm (819 b2 63) = mu(01 N 02 N84) =0 
mm (1) = m12(01) + m1(01)ma(01 N 02) + m2(01)m1(01 N 02) + m1(01)ma(01 N 03) 
+m2a(01)m1(01 N 63) = 0.11 
mm(0,) = m12(02) + m1(02)ma(01 N 02) + m2(02)m1(01 N 02) + m1(02)ma(01 N 03) 
+m2a(02)m1 (01 N 83) = 0.33 
mM (04) = m:12(04) + m1(04)ma(01 N 02) + ma(04)m1(01 N 02) + m1(04)ma(01 N 03) 
+ma(04)m1(01 N 03) = 0.15 
64) = mi2(01 N 64) = 0.07 
N 04) = mi(03 N 04) = 0.15 


mm (61 U 02) =m 2(01 U 92) + m1 (64 N b2)M2(01 N 92) + my (01)ma(02) + ma(01)m1 (02) = 0.12 











mm (61 U 03) =m 2(01 U 03) + mı(0ı N 03)ma(01 N 03) = 0.02 








MM (01 UU 63) = my42(61 U 02 U 03) + mı(0ı N 92)ma(01 N 03) + ma(01 N 92)m1(01 N 03) = 0.05 


4.7 Bayesian mixture of hybrid DSm models 


In the preceding, one has first shown how to combine generalized basic belief assignments provided by 
k > 2 independent and equally reliable sources of information with the general hybrid DSm rule of com- 
bination for dealing with all possible kinds of integrity constraints involved in a model. This approach 
implicitly assumes that one knows/trusts with certainty that the model M (usually a hybrid DSm model) 
of the problem is valid and corresponds to the true model. In some complex fusion problems however 


(static or dynamic ones), one may have some doubts about the validity of the model M on which is 
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based the fusion because of the nature and evolution of elements of the frame O. In such situations, we 
propose to consider a set of exclusive and exhaustive models (M1,Ma,..., Mk) with some probabil- 
ities {P{M1}, P{Mo},..., P{Mx«}}. We don’t go here deeper on the justification/acquisition of such 
probabilities because this is highly dependent on the nature of the fusion problem under consideration. 
We just assume here that such probabilities are available at any given time t; when the fusion has to 
be done. We propose then to use the Bayesian mixture of combined masses myy,(@)(.) i = 1,..., K to 
obtain the final result : 


K 
VAEDO, — mm... melA) = Y P{Mi}mm,(0)(A) (4.14) 


i=l 
4.8 Conclusion 


In this chapter we have extended the DSmT and the classical DSm rule of combination to the case 
of any kind of hybrid model for the frame O involved in many complex fusion problems. The free- 
DSm model (which assumes that none of the elements of the frame is refinable) can be interpreted as 
the opposite of Shafer's model (which assumes that all elements of the frame are truly exclusive) on 
which is based the mathematical theory of evidence (Dempster-Shafer Theory - DST). Between these two 
extreme models, there exists actually many possible hybrid models for the frames O depending on the real 
intrinsic nature of elements of the fusion problem under consideration. For real problems, some elements 
of O can appear to be truly exclusive whereas some others cannot be considered as fully discernable 
or refinable. This present research work proposes a new hybrid DSm rule of combination for hybrid 
models based on the DSmT. The hybrid DSm rule works in any model and is involved in calculation 
of mass fusion of any number of sources of information, no matter how big is the conflict /paradoxism 
of sources, and on any frame (exhaustive or non-exhaustive, with elements which may be exclusive or 
non-exclusive or both). This is an important rule since does not require the calculation of weighting 
factors, neither normalization as other rules do, and the transfer of masses of empty-sets to the masses 
of non-empty sets is naturally done following the DSm network architecture which is derived from the 
DSm classic rule. DSmT together with hybrid DSm rule is a new solid alternative to classical approaches 
and to existing combination rules. This new result is appealing for the development of future complex 


(uncertain/incomplete/paradoxical/dynamical) information fusion systems. 
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Abstract: This chapter presents several classes of fusion problems which cannot 
be directly approached by the classical mathematical theory of evidence, also known 
as Dempster-Shafer Theory (DST), either because Shafer’s model for the frame of 
discernment is impossible to obtain, or just because Dempster’s rule of combination 
fails to provide coherent results (or no result at all). We present and discuss the 
potentiality of the DSmT combined with its classical (or hybrid) rule of combination 


to attack these infinite classes of fusion problems. 
5.1 Introduction 


n this chapter we focus our attention on the limits of the validity of Dempster’s rule of combination 
T Dempster-Shafer theory (DST) [5]. We provide several infinite classes of fusion problems where 
Dempster rule of combination fails to provide coherent results and we show how these problems can be 
attacked directly by the DSmT presented in previous chapters. DST and DSmT are based on a different 
approach for modelling the frame O of the problem (Shafer’s model versus free-DSm, or hybrid-DSm 


model), on the choice of the space (classical power set 2% versus hyper-power set D9) on which will 
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be defined the basic belief assignment functions m;(.) to be combined, and on the fusion rules to apply 


(Dempster rule versus DSm rule or hybrid DSm rule of combination). 


5.2 First infinite class of counter examples 


The first infinite class of counter examples for Dempster’s rule of combination consists trivially in all cases 
for which Dempster’s rule becomes mathematically not defined, i.e. one has 0/0, because of full conflicting 
sources. The first sub-class presented in subsection [5.2.1] corresponds to Bayesian belief functions. The 


subsection [5.2.2] will present counter-examples for more general conflicting sources of evidence. 


5.2.1 Counter-examples for Bayesian sources 


The following examples are devoted only to Bayesian sources, i.e. sources for which the focal elements of 


belief functions coincide only with some singletons 6; of €. 


5.2.1.1 Example with O = (01,02) 


Let's consider the frame of discernment O = {61,02}, two independent experts, and the basic belief 
masses: 
m1(01) =1 m1(02) =0 
m2(01) =0 ma(02) =1 
We represent these belief assignments by the mass matrix 
M = 
0 1 
e Dempster's rule can not be applied because one formally gets m(0,) = 0/0 and m(62) = 0/0 as 


well, i.e. undefined. 


e The DSm rule works here because one obtains m(0,) = m(02) = 0 and m(0, N 62) = 1 (the total 
paradox, which it really is! if one accepts the free-DSm model). If one adopts Shafer’s model and 
applies the hybrid DSm rule, then one gets m,(01 U 02) = 1 which makes sense in this case. The 
index h denotes here the mass obtained with the hybrid DSm rule to avoid confusion with result 


obtained with the DSm classic rule. 


5.2.1.2 Example with O = [0,,02,03, 64} 


Let's consider the frame of discernment O = {01, 02,03,04}, two independent experts, and the mass 


matrix 
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e Again, Dempster’s rule can not be applied because: V1 < j < 4, one gets m(6;) = 0/0 (undefined!). 


e But the DSm rule works because one obtains: m(0,) = m(02) = m(03) = m(04) = 0, and m(9, N 


02) = 0.12, m(01 N 04) = 0.48, m(02 N 03) = 0.08, m(03 N 64) = 0.32 (partial paradoxes/conflicts). 


e Suppose now one finds out that all intersections are empty (Shafer's model), then one applies 
the hybrid DSm rule and one gets (index h stands here for hybrid rule): mp(01 U 02) = 0.12, 
mp(01 U 64) = 0.48, mrp (02 U 03) = 0.08 and Mnp (03 U 04) = 0.32. 


5.2.1.3 Another example with O = {01, 02, 03, 04} 


Let's consider the frame of discernment © = {01, 02,03,04}, three independent experts, and the mass 


matrix 


06 0 04 0 
0 02 0 0.8 
0 03 0 07 


e Again, Dempster’s rule can not be applied because: V1 < j < 4, one gets m(6;) = 0/0 (undefined!). 





e But the DSm rule works because one obtains: m(9,) = m(62) = m(03) = m(84) = 0, and 





m(61 N 02) = 0.6 - 0.2 - 0.3 = 0.036 


m(61 N 64) = 0.6 - 0.8 - 0.7 = 0.336 


m(02 N 63) = 0.4 - 0.2 - 0.3 = 0.024 











m(03 N 64) = 0.4 - 0.8 - 0.7 = 0.224 


m(61 N 02 N 04) = 0.6 - 0.2 - 0.7 + 0.6 - 0.3 - 0.8 = 0.228 





m(0, N 0304) = 0.2-0.4-0.7+0.3-0.4-0.8 = 0.152 


(partial paradoxes/conflicts) and the others equal zero. If we add all these masses, we get the sum 


equals to 1. 


e Suppose now one finds out that all intersections are empty (Shafer’s model), then one applies the 
hybrid DSm rule and one gets: mp(61 U 62) = 0.036, mp(61 U 04) = 0.336, mp(A2 U 03) = 0.024, 
Mn (03 U 64) = 0.224, mp (01 U Ag U 94) = 0.228, Mp (02 U 03 U 04) = 0.152. 


5.2.1.4 More general 


Let's consider the frame of discernment O = {01,02,..., 0n}, with n > 2, and k experts, for k > 2. Let 


M = [aij], 1 <i <k, 1< j< n, be the mass matrix with k rows and n columns. If each column of the 





mass matrix contains at least a zero, then Dempster’s rule can not be applied because one obtains for 
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all 1 < j < n, m(6;) = 0/0 which is undefined! The degree of conflict is 1. However, one can use the 
classical DSm rule and one obtains: for all 1 < j < n, m(6;) = 0, and also partial paradoxes/conflicts: 


Vi<us<n,l<s<w,and2<w<k, m(O,90,9...960,,) = X (art) (Gat) --- (akty), Where 





the set T = [t1,t2,...,tx) is equal to the set V = {v1,v2,...,Uw} but the order may be different and 
the elements in the set T could be repeated; we mean from set V one obtains set T if one repeats some 
elements of V; therefore: summation >> is done upon all possible combinations of elements from columns 
U1,U2,..., Vw such that at least one element one takes from each of these columns v1, v2, ..., Vw and also 
such that from each row one takes one element only; the product (a1z,) - (a21,) +... + (aktą) contains one 
element only from each row 1, 2, ..., k respectively, and one or more elements from each of the columns 


U], U2, +++, Uw respectively. 


5.2.2 Counter-examples for more general sources 


We present in this section two numerical examples involving general (i.e. non Bayesian) sources where 


Dempster’s rule cannot be applied. 


5.2.2.1 Example with O = (81, 02, 03, 04) 


Let's consider O = (01, 02,03, 04), two independent experts, and the mass matrix: 


ape fete aue 





ma 0.4 | 0.5 
mal. 0.3 | 0.7 


Dempster’s rule cannot apply here because one gets 0/0 for all m(@;), 1 < i < 4, but the DSm rules 


(classical or hybrid) work. 


Using the DSm classical rule: m(0, N03) = 0.12, m(0, N 64) = 0.28, m(02N 03) = 0.15, m(02N 04) = 0.35, 
m(03 N (01 U 92)) = 0.03, m(04 N (01 U 92)) = 0.07. 


Suppose now one finds out that one has a Shafer model; then one uses the hybrid DSm rule (denoted 
here with index Ah): m;(01 U 63) = 0.12, mp(O1 U 04) = 0.28, my (02 U 03) = 0.15, my (02 U 84) = 0.35, 
Mnp (03 U0 U 92) = 0.03, mr(O4 Uĝ U 02) = 0.07. 

5.2.2.2 Another example with O = {01, 82,03,04} 


Let's consider O = (01,02,03, 04), three independent experts, and the mass matrix: 
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Tape Pa faves ae 





mi(.) | 0.4 | 0.5 
mal.) 0.3 | 0.6 
ma(.) | 0.8 


Dempster’s rule cannot apply here because one gets 0/0 for all m(@;), 1 < i < 4, but the DSm rules 


(classical or hybrid) work. 


Using the DSm classical rule, one gets: 









































m(61) = m(62) = m(03) = m(04) = 0 m(61 U 62) = m(63 U 04) = 0 

m(61 63) = 0.096 m(61 N 63 N (01 U62)) = m(61 N 03) = 0.024 

m(0, 64) = 0.192 m(01 N 84N (01 U 02)) = m(O, N 94) = 0.048 

m(01 N (03 U 04)) = 0.032 m(01 N (03 U 04) N (61 U 82)) = m(91 (03 U 84)) = 0.008 
m(92 N 63 N 91) = 0.120 m(92 N 63 N (01 U62)) = m(02 N 93) = 0.030 

m(02 N 64M 01) = 0.240 m(02 N 64 N (01 U 02)) = m(42 N 94) = 0.060 

m(02 N (03 U 84) N 61) = m((A1 N 62) N (83 U A4)) = 0.040. mí(B2 N (63 U 64) N (01 U62)) = m(92 N (63 U 64)) = 0.010 
m((01 U 02) N 03 N 01) = m(01 N 03) = 0.024 m((01 U 02) N 63) = 0.006 

m((01 U 82) N b4 N 01) = m(01 N 84) = 0.048 m((01 U 82) N 64) = 0.012 








m (01 U 02) N (03 U 04) N 01) = m(0ı M (03 U 64)) = 0.008 m (01 U 02) M (03 U 04)) = 0.002 


After cumulating, one finally gets with DSm classic rule: 




















m(01 N83) = 0.096 + 0.024 + 0.024 = 0.144 m(01 N 84) = 0.192 + 0.048 + 0.048 = 0.288 
m(02 N 03) = 0.030 m(02 N 04) = 0.060 

m(01 N 62 N 03) = 0.120 m(01 N 82N 04) = 0.240 

m((01 U 02) N 03) = 0.006 m((01 U 62) N 04) = 0.012 

m(01 N (03 U 04)) = 0.032 + 0.008 + 0.008 = 0.048 m(01 N 62 (03 U 04)) = 0.040 

m(02 N (93 U 04)) = 0.010 m((01 U 62) N (03 U 04)) = 0.002 


Suppose now, one finds out that all intersections are empty. Using the hybrid DSm rule one gets: 








mn (01 U 03) = 0.144 mp (01 U 04) = 0.288 

mn(02 U 03) = 0.030 mn (02 U 04) = 0.060 

mn(01 U 02 U 3) = 0.120 + 0.006 = 0.126 mní01 U 02 U 04) = 0.240 + 0.012 = 0.252 
mp (01 U 03 U 04) = 0.048 mn (02 U 3 U 94) = 0.010 

Mp (01 U 02 U 03 U 04) = 0.040 + 0.002 = 0.042 
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5.2.2.3 More general 


Let's consider the frame of discernment O = [01,02,...,0,), with n > 2, and k experts, for k > 2, and 
the mass matrix M with k rows and n+ u columns, where u > 1, corresponding to 01, 02, ..., On, and u 


uncertainties 0; U... U Ois, ..., Oja U...U0j, respectively. 


If the following conditions occur: 
e each column contains at least one zero; 
e all uncertainties are different from the total ignorance 6; U...U6,, (i.e., they are partial ignorances); 
e the partial uncertainties are disjoint two by two; 


e for each non-null uncertainty column c;,n+1<j7<n+4u, of the form say 0p, U...U Opu, there 


exists a row such that all its elements on columns p1, ..., Pw, and cj are zero. 


then Dempster’s rule of combination cannot apply for such infinite class of fusion problems because one 
gets 0/0 for all m(6;), 1 < i < n. The DSm rules (classical or hybrid) work for such infinite class of 


examples. 


5.3 Second infinite class of counter examples 


This second class of counter-examples generalizes the famous Zadeh example given in [7] [8]. 


5.3.1 Zadeh’s example 


Two doctors examine a patient and agree that it suffers from either meningitis (M), contusion (C) or 
brain tumor (T). Thus O = {M,C,T}. Assume that the doctors agree in their low expectation of a 


tumor, but disagree in likely cause and provide the following diagnosis 
m1(M) = 0.99 m,(T) = 0.01 and ma(C) = 0.99 ma(T) = 0.01 


If we combine the two basic belief functions using Dempster’s rule of combination, one gets the unexpected 


final conclusion 
0.0001 
mt 1 — 0.0099 — 0.0099 — 0.9801 i 


which means that the patient suffers with certainty from brain tumor !!!. This unexpected result arises 
from the fact that the two bodies of evidence (doctors) agree that the patient most likely does not 
suffer from tumor but are in almost full contradiction for the other causes of the disease. This very sim- 


ple but interesting example shows the limitations of the practical use of the DST for automated reasoning. 
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This example has been examined in literature by several authors to explain the anomaly of the result 
of Dempster’s rule of combination in such case. Due to the high degree of conflict arising in such extreme 
case, willingly pointed out by Zadeh to show the weakness of this rule, it is often argued that in such case 
the result of Dempster’s rule must not be taken directly without checking the level of the conflict between 
sources of evidence. This is trivially true but there is no theoretical way to decide beforehand if one can 
trust or not the result of such rule of combination, especially in complex systems involving many sources 
and many hypotheses. This is one of its major drawback. The issue consists generally in choosing rather 
somewhat arbitrarily or heuristically some threshold value on the degree of conflict between sources to 
accept or reject the result of the fusion [9]. Such approach can't be solidly justified from theoretical anal- 
ysis. Assuming such threshold is set to a given value, say 0.70 for instance, is it acceptable to reject the 
fusion result if the conflict appears to be 0.7001 and accept it when the conflict becomes 0.6999? What 
to do when the decision about the fusion result is rejected and one has no assessment on the reliability 
of the sources or when the sources have the same reliability/confidence but an important decision has to 
be taken anyway? There is no theoretical solid justification which can reasonably support such kind of 


approaches commonly used in practice up to now. 


The two major explanations of this problem found in literature are mainly based, either on the fact 
that problem arises from the closed-world assumption of Shafer’s model O and it is suggested to work 
rather with an open-world model, and/or the fact that sources of evidence are not reliable. These ex- 
planations although being admissible are not necessarily the only correct (sufficient) explanations. Note 
that the open-world assumption can always be easily relaxed advantageously by introducing a new hy- 
pothesis, say ĝo in the initial frame O = {0),...,0,} in order to close it. 69 will then represent all 
possible alternatives (although remaining unknown) of initial hypotheses 01,... 0n. This idea has been 
already proposed by Yager in [6] through his hedging solution. Upon our analysis, it is not necessary to 
adopt /follow the open-world model neither to admit the assumption about the reliability of the sources 
to find a justification in this counter-intuitive result. Actually, both sources can have the same reliability 
and Shafer’s model can be accepted for the combination of the two reports by using another rule of 
combination. This is exactly the purpose of the hybrid DSm rule of combination. Of course when one 
has some prior information on the reliability of sources, one has to take them into account properly by 
some discounting methods. The discounting techniques can also apply in the DSmT framework and there 
is no incompatibility to mix both (i.e. discounting techniques with DSm rules of combinations) when 
necessary (when there is strong reason to justify doing it, i.e. when one has prior reliable information 
on reliability of the sources). The discounting techniques must never been used as an artificial ad-hoc 
mechanism to update Dempster’s result once problem has arisen. We strongly disagree with the idea that 


all problems with Dempster’s rule can be solved beforehand by discounting techniques. This can help 
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obviously to improve the assessment of belief function to be combined when used properly and fairly, but 
this does not fundamentally solve the inherent problem of Dempster’s rule itself when conflict remains 


high. 


The problem comes from the fact that both sources provide essentially their belief with respect only to 
their own limited knowledge and experience. It is also possible in some cases, that sources of information 
even don’t have the same interpretation of concepts included in the frame of the problem. Such kind of 
situation frequently appears for example in debates on TV, on radio or in most of the meetings where 
important decision/approval have to be drawn and when the sources don’t share the same opinion. This 
is what happens daily in real life and one has to deal with such conflicting situations anyway. In other 
words, the sources do not speak about the same events or even they do, they there is a possibility that 
they do not share the same interpretation of the events. This has already been pointed out by Dubois 
and Prade in [3] (p. 256). In Zadeh’s controversy example, it is possible that the first doctor is expert 
mainly in meningitis and in brain tumor while the second doctor is expert mainly in cerebral contusion 
and in brain tumor. Because of their limited knowledges and experiences, both doctors can also have 
also the same reliability. If they have been asked to give their reports only on O = {M,C,T} (but not 
on an extended frame), their reports have to be taken with same weight and the combination has to be 
done anyway when one has no solid reason to reject one report with respect to the other one; the result 
of the Demsper’s rule still remains very questionable. No rational brain surgeon would take the decision 
for a brain intervention (i.e. a risky tumor ablation) based on Dempster’s rule result, neither the family 
of the patient. Therefore upon our analysis, the two previous explanations given in literature (although 
being possible and admissible in some cases) are not necessary and sufficient to explain the source of 
the anomaly. Several alternatives to Dempster’s rule to circumvent this anomaly have been proposed 
in literature mainly through the works of R. Yager [6], D. Dubois and H. Prade [2] already reported in 
chapter[JJor by Daniel in [1]. The DSmT offers just a new issue for solving also such controversy example 
as it will be shown. In summary, some extreme caution on the degree of conflict of the sources must 
always be taken before taking a final decision based on Dempster’s rule of combination, especially when 


vital wagers are involved. 


If we now adopt the free-DSm model, i.e. we replace the initial Shafer model by accepting the 
possibility of non null intersections between hypotheses M, C and T and by working directly on hyper- 
power set DY then one gets directly and easily the following result with the classical DSm rule of 


combination: 


m( MOC) =0.9801  m(MNT)=0.0099 m(CNT)=0.0099  m/(T) =0.0001 
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which makes sense when working with such a new model. Obviously same result can be obtained (the 
proof is left here to the reader) when working with Dempster’s rule based on the following refined frame 


Oref defined with basic belief functions on power set QOref 


Ores = {01 = MNCNT, 02 =MNCNT,03 = MNCNT, 04 = MNCNT, 
65 =MNCNT,06 =MNCNT,6, =MNCnNT} 


where T,C and M denote respectively the complement of T, C and M. 


The equality of both results (i.e. by the classical DSm rule based on the free-DSm model and by 
Dempster’s rule based on the refined frame) is just normal since the normalization factor 1— k of Demp- 
ster’s rule in this case reduces to 1 because of the new choice of the new model. Based on this remark, 
one could then try to argue that DSmT (together with its DSm classical rule for free-DSm model) is 
superfluous. Such claim is obviously wrong for the two following reasons: it is unecessary to work with 
a bigger space (keeping in mind that |D®°| < |29res|) to get the result (the DSm rule offers just a direct 
and more convenient issue to get the result), but also because in some fusion problems involving vague/- 
continuous concepts, the refinement is just impossible to obtain and we are unfortunately forced to deal 


with ambiguous concepts/hypotheses (see [4] for details and justification). 


If one has no doubt on the reliability of both Doctors (or no way to assess it) and if one is absolutely 
sure that the true origin of the suffering of the patient lies only in the frame O = {M,C,T} and we 
consider these origins as truly exclusive, then one has to work with the initial frame of discernment 
© satisfying Shafer’s model. As previously shown, Dempster’s rule fails to provide a reasonable and 
acceptable conclusion in such high conflicting case. However, this case can be easily handled by the 
hybrid DSm rule of combination. The hybrid DSm rule applies now because Shafer’s model is nothing 
but a particular hybrid model including all exclusivity constraints between hypotheses of the frame O 
(see chapter [A] for details). One then gets with the hybrid DSm rule for this simple case (more general 
and complex examples have been already presented in chapter [A), after the proper mass transfer of all 


sources of the conflicts: 
m(M UC) = 0.9801 m(M UT) = 0.0099 m(C UT) = 0.0099 m(T) = 0.0001 


This result is not surprising and makes perfectly sense with common intuition actually since it provides 
a coherent and reasonable solution to the problem. It shows clearly that a brain intervention for ablation 
of an hypothetical tumor is not recommended, but preferentially a better examination of the patient 
focused on Meningitis or Contusion as possible source of the suffering. The consequence of the results of 


Dempster’s rule and the hybrid DSm rule is therefore totally different. 
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5.3.2 Generalization with O = (0,02, 03} 


Let's consider 0 < €1,€2 < 1 be two very tiny positive numbers (close to zero), the frame of discernment 


be O = {61, 62, 63}, have two experts (independent sources of evidence sı and s2) giving the belief masses 
m1(01) =1- €1 m1 (02) =0 m1 (03) = €1 


ma(01)=0 mal(02) = 1 — ez ma[(03) = ez 


From now on, we prefer to use matrices to describe the masses, i.e. 


e Using Dempster’s rule of combination, one gets 


(e1€2) 


ne > aD E aay as 


=1 


which is absurd (or at least counter-intuitive). Note that whatever positive values for €1, €2 are, 
Dempster’s rule of combination provides always the same result (one) which is abnormal. The only 
acceptable and correct result obtained by Dempster’s rule is really obtained only in the trivial case 


when e, = €2 = 1, i.e. when both sources agree in 03 with certainty which is obvious. 


Using the DSm rule of combination based on free-DSm model, one gets m(03) = €1€2, m(01 N 02) = 
(1 — e) dl — €2), m(01 N 03) = (1 — €1)€2, m(02 N 03) = (1 — €2)e, and the others are zero which 


appears more reliable/trustable. 


e Going back to Shafer’s model and using the hybrid DSm rule of combination, one gets m(03) = €1€2, 
m(01 U 92) = (1 E end 2 €2), m(0ı U 03) = (1 a €1)€2, m(O2 U 63) = (1 pi €2)€1 and the others are 


Zero. 
Note that in the special case when €; = es = 1/2, one has 
m1(01) = 1/2 mi1(02) = 0 m1(03) = 1/2 and m2(@1) = 0 ma(02) = 1/2 ma(03) = 1/2 


Dempster’s rule of combinations still yields m(93) = 1 while the hybrid DSm rule based on the same 
Shafer’s model yields now m(03) = 1/4, m(01 U 02) = 1/4, m(@1 U 03) = 1/4, m(02U 03) = 1/4 which is 


normal. 


5.3.3 Generalization with O = {6, 02, 03, 04) 


Let's consider 0 < €1,€2,€3 < 1 be three very tiny positive numbers, the frame of discernment be 


O = {61, 2,03, 04), have two experts giving the mass matrix 


l-—e€ — €2 0 €1 €2 


0 1 — €63 0 €3 
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Again using Dempster’s rule of combination, one gets m(@4) = 1 which is absurd while using the DSm rule 
of combination based on free-DSm model, one gets m(04) = €2€3 which is reliable. Using the DSm classical 
rule: m(01N82) = (1—e1 —e2)(1—e3), m(01N64) = (1—e1 —e3)e3, m(93N02) = €1(1—€3), m(03N04) = €163, 
m(64) = €2€3. Suppose one finds out that all intersections are empty, then one applies the hybrid DSm 
rule: m,(01 U 62) = (1 — €1 — €2)(1 — ez), mp(01 U 04) = (1 — €1 — ez)ez, mn[03 U 02) = er (1 — ez), 


Mnp (03 U 64) = €1€3, mp(04) = €2€3. 


5.3.4 More general 


Let's consider 0 < €1,...,€) < 1 be very tiny positive numbers, the frame of discernment be O = 
(91,...,0n,0n+1), have two experts giving the mass matrix 
1— Sí 0 a 0 ae ON ~weg 
0 1— Spy O} 41 00 ke Eno ER 


where 1 < p < n and SẸ? £ 7%_, es and S2, 4 J i-p+1 €i: Again using Dempster’s rule of combination, 
one gets m(0,,+1) = 1 which is absurd while using the DSm rule of combination based on free-DSm model, 


one gets m(8n+1) = €p€n which is reliable. This example is similar to the previous one, but generalized. 


5.3.5 Even more general 


Let's consider 0 < €1,...,€n < 1 be very tiny positive numbers (close to zero), the frame of discernment 
be O = {01,..., On, On+1}, have k > 2 experts giving the mass matrix of k rows and n + 1 columns such 


that: 


e one column, say column j, is (€;,,€;,,.-.,€;,)’ (transposed vector), where 1 < j < n +1 where 


{€j €jo,-++>€j, y is included in {€1, €2,...,€n}; 
e and each column (except column j) contains at least one element equals to zero. 


Then Dempster’s rule of combination gives m(6;) = 1 which is absurd, while the classical DSm rule gives 


m(0;) = €j, * Eja +--+ + Ej, FO which is reliable. 
Actually, we need to set restrictions only for €;,, Eja, ..., and €j, to be very tiny positive numbers, 
not for all €1, €2, ..., €n (the others can be anything in the interval [0, 1) such that the sum of elements 


on each row be equal 1). 


5.4 Third infinite class of counter examples 


This third class of counter-examples deals with belief functions committing a non null mass to some 


uncertainties. 
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5.4.1 Example with O = (01, da, 03, 01) 


Let's consider O = (01,02, 03,04), two independent experts, and the mass matrix: 


Je Je Jefa 





mal) [0.99 | 0 0.01 
mal) | 0 | 0.98 0.02 


If one applies Dempster’s rule, one gets 


(0.01 - 0.02) 
m(03 U 04) (0 + 0 + 0 + 0 + 0.01 - 0.02) 


(total ignorance), which doesn’t bring any information to the fusion. This example looks similar to 
Zadeh’s example, but is different because it is referring to uncertainty (not to contradictory) result. 
Using the DSm classical rule: m(9, N02) = 0.9702, m(9,N (03 U84)) = 0.0198, m(92N (03 U 84)) = 0.0098, 
m(03 U 64) = 0.0002. Suppose now one finds out that all intersections are empty (i.e. one adopts 
Shafer’s model). Using the hybrid DSm rule one gets: mp (01 U 82) = 0.9702, m, (01 U 63 U 04) = 0.0198, 
Mp (02 U 03 U 04) = 0.0098, ma (03 U 04) = 0.0002. 


5.4.2 Example with O = (01, b2, 03, 04, 05) 


Let's consider O = {6}, 02, 03, 04, , 05}, three independent experts, and the mass matrix: 


| |e |e o|o | sues 





mi.) |099) 0 | 0 0.01 
ma(.) | o | 0.98 | 0.01 0.01 
ms(.) | 0.01 | 0.01 | 0.97 0.01 


e If one applies Dempster’s rule, one gets 


(0.01 - 0.01 - 0.01) 
PIO TAAN a 
m(64U 9) = FoF 040+40.01-0.01 001) 


(total ignorance), which doesn’t bring any information to the fusion. 


e Using the DSm classical rule one gets: 


m(61 N 02) = 0.99 - 0.98 - 0.01 + 0.99 - 0.98 - 0.01 = 0.019404 





m(01 N 83) = 0.99 - 0.01 - 0.01 + 0.99 - 0.01 - 0.97 = 0.009702 
m(01 N 02 N 03) = 0.99 - 0.98 - 0.97 + 0.99 - 0.01 - 0.01 = 0.941193 
m(01 N 83 N (04 U 05)) = 0.99 - 0.01 - 0.01 + 0.99 - 0.01 - 0.97 + 0.01 - 0.01 - 0.01 = 0.009703 
m(01 N (04 U 05)) = 0.99 - 0.01 - 0.01 + 0.99 - 0.01 - 0.01 + 0.01 - 0.01 - 0.01 = 0.000199 


m((04 U 05) N 02 N 01) = 0.01 - 0.98 - 0.01 + 0.99 - 0.01 - 0.01 + 0.99 - 0.98 - 0.01 = 0.009899 
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m((64 U 05) N 02) = 0.01 - 0.98 - 0.01 + 0.01 - 0.98 - 0.01 + 0.01 - 0.01 - 0.01 = 0.000197 
m((04 U 05) N 9, 43) = 0.01 - 0.98 - 0.97 + 0.01 - 0.01 - 0.01 = 0.009507 
m((64 U 45) N 03) = 0.01 - 0.01 - 0.97 + 0.01 - 0.01 - 0.01 + 0.01 - 0.01 - 0.97 = 0.000195 
m(04 U 05) = 0.01 - 0.01 - 0.01 = 0.000001 


The sum of all masses is 1. 


e Suppose now one finds out that all intersections are empty (Shafer’s model), then one uses the 


hybrid DSm rule and one gets: 


mn (61 U 02) = 0.019404 mn(61 U 03) = 0.009702 
mp(01 U 62 U 03) = 0.941193 mp(6, U 3 U 04 U 05) = 0.009703 

















( 
( 
mp (01 U 04 U 05) = 0.000199 mn(04 U 0s U 02 U 01) = 0.009899 
mp (04 U 0s U 02) = 0.000197 mn (04 U Os U 02 U 03) = 0.009507 
mp (04 U 05 U 03) = 0.000195 mna(04 U 65) = 0.000001 


The sum of all masses is 1. 


5.4.3 More general 


Let O = {61,...,4n}, where n > 2, k independent experts, k > 2, and the mass matrix M of k rows and 
n+ 1 columns, corresponding to 61, 02, ..., On, and one uncertainty (different from the total uncertainty 


01U02U...U0,) say ĝi U...U0;, respectively. If the following conditions occur: 


e each column contains at least one zero, except the last column (of uncertainties) which has only 


non-null elements, 0 < €1,€2,...,€x < 1, very tiny numbers (close to zero); 
e the columns corresponding to the elements 6;,,..., 0;, are null (all their elements are equal to zero). 


If one applies Dempster’s rule, one gets m(@;, U...U6;,) = 1 (total ignorance), which doesn’t bring any 


information to the fusion. 


5.4.4 Even more general 


One can extend the previous case even more, considering to u uncertainty columns, u > 1 as follows. 


Let O =(01,...,0,,), where n > 2, k independent experts, k > 2, and the mass matrix M of k rows 
and n + u columns, corresponding to 61, 62, ..., On, and u uncertainty columns (different from the total 


uncertainty 01 U 62 U ... U Ôn) respectively. If the following conditions occur: 


e each column contains at least one zero, except one column among the last u uncertainty ones which 


has only non-null elements 0 < €1,€2,...,€% < 1, very tiny numbers (close to zero); 
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e the columns corresponding to all elements 0;,,..., 9:,,---,; Or,,.., Or, (of course, these elements 
should not be all 61, 62,..., On, but only a part of them) that occur in all uncertainties are null 


(i.e., all their elements are equal to zero). 


If one applies Dempster’s rule, one gets m(0;, U...U6;,) = 1 (total ignorance), which doesn’t bring any 


information to the fusion. 


5.5 Fourth infinite class of counter examples 


This infinite class of counter-examples concerns Dempster’s rule of conditioning defined as : 


Dx Y E28 (XNY)=B m(X)ma(Y) 


VB € a. m(B|A) = — AAA AO 0  — 
Ie) 1— Vx y eze (xny)=0 m(X)ma(Y) 


where m/(.) is any proper basic belief function defined over 2° and ma(.) is a particular belief function 


defined by choosing m4(A) = 1 for any A € 2° with AF 0. 


5.5.1 Example with O =(0,,...,06) 


Let's consider O = (61,...,06), one expert and a certain body of evidence over 92, with the mass matrix: 


DON ES SIE non ave 





ml 0.3 0.4 
me, (. 


e Using Dempster’s rule of conditioning, one gets: m(.|92) = 0/0 for all the masses. 


e Using the DSm classical rule, one gets: 


m(01M62|02) = 0.3 m(02N63|02) = 0.4 m(92(0,4U0;5)|02) = 0.2 m(92(05U066)|02) = 0.1 


e If now, one finds out that all intersections are empty (we adopt Shafer's model), then using the 


hybrid DSm rule, one gets: 


mp(01U02|02) =0.3 mp. (02U03|02) = 0.4 mp (02U64U05|02) = 0.2 mp. (02U05U066|02) =0.1 


5.5.2 Another example with O = {6),...,06} 


Let's change the previous counter-example and use now the following mass matrix: 


AAA 





me, (.) 0 1 
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e Using Dempster’s rule of conditioning, one gets: m/(.|62) = 0/0 for all the masses. 
e Using the DSm classical rule, one gets: m(01 N 02102) = 1, and others 0. 


e If now, one finds out that all intersections are empty (we adopt Shafer’s model), then using the 


hybrid DSm rule, one gets: m;(01 U 62192) = 1, and others 0. 


5.5.3 Generalization 


Let O = [6,,02,...,0, y, where n > 2, and two basic belief functions/masses mi (.) and ma(.) such that 
there exist 1 < (i # j) < n, where m1(6;) = m2(0,) = 1, and 0 otherwise. Then Dempster’s rule of 


conditioning can not be applied because one gets division by zero. 


5.5.4 Example with O = {01, 02, 03,04} and ignorance 


Let's consider O = (61,02, 01,02), one expert and a certain ignorant body of evidence over 63 U 64, with 


the mass matrix: 





e Using Dempster’s rule of conditioning, one gets 0/0 for all masses m/(.|03 U 04). 


e Using the classical DSm rule, one gets: m(9, N (03 U04)|03 U04) = 0.3, m(02N(03U04)/03U04) = 0.7 


and others 0. 


e If now one finds out that all intersections are empty (Shafer's model), using the hybrid DSm rule, 


one gets m(01 U 03 U 04]03 U 04) = 0.3, m(02 U 03 U 04/03 U 04) = 0.7 and others 0. 


5.5.5 Generalization 


Let O = {61, 62,..-, On; On41,---;On4+m}, for n > 2 and m > 2. Let's consider the mass m1(.), which is a 
row of its values assigned for 0,,02,...,0,, and some unions among the elements 0,41, ..., On+m such 
that all unions are disjoint with each other. If the second mass ma(.) is a conditional mass, where A 
belongs to {61, 62,...,4,} or unions among @n41, ---, On+m, such that m (A) = 0, then Dempster’s rule 
of conditioning can not be applied because one gets division by zero, which is undefined. [We did not 
consider any intersection of 0; because Dempster’s rule of conditioning doesn’t accept paradoxes]. But 


the DSm rule of conditioning does work here as well. 
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5.5.6 Example with a paradoxical source 


A counter-example with a paradox (intersection) over a non-refinable frame, where Dempster’s rule of 
conditioning can not be applied because Dempster-Shafer theory does not accept paradoxist /conflicting 


information between elementary elements 6; of the frame O: 


Let's consider the frame of discernment O = {61,62}, one expert and a certain body of evidence over 


Ta [a [ane [nom 


02, with the mass matrix: 


mi() | 0.2101] 04 0.3 
me,(.) | 0 | 1 0 0 


Using the DSm rule of conditioning, one gets 





and the sum of fusion results is equal to 1. 
Suppose now one finds out that all intersections are empty. Using the hybrid DSm rule when 61 N02 = 
Ø, one has: 


mp (01 N b2 02) =0 
mp(01 02) = m(01|02) T [my (0; )m2(04 A 02) T ma(01)m1 (01 A 02)] =U 


Mp (02 02) = m(62|02) Se [m (82)me2(O4 NM 92) =P ma(02)m1 (0, M 62)| = 0.4 + 0.1(0) + 1(0.4) = 0.8 











mp(01 U 02 02) = m(0, U b2 02) + [my (01 )m2(O2) ses ma(01)m1 (02)] 
+ [my (01 N b2)M2 (01 U 02) + ma(01 N b2)Mı (01 U 62)] + [ma (01 N 92)ma(01 N 62)] 
= 0+ [0.2(1) + 0(0.1)] + [0.4(0) + 0(0.3)] + [0.4(0)] 


= 0.2 + [0] + [0] + [0] = 0.2 


5.6 Conclusion 


Several infinite classes of counter-examples to Dempster’s rule of combination have been presented in this 
chapter for didactic purposes to show the limitations of this rule in the DST framework. These infinite 
classes of fusion problems bring the necessity of generalizing the DST to a more flexible theory which 
permits the combination of any kind of sources of information with any degree of conflict and working on 
any frame with exclusive or non-exclusive elements. The DSmT with the hybrid DSm rule of combination 


proposes a new issue to satisfy these requirements based on a new mathematical framework. 
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Chapter 6 


Fusion of imprecise beliefs 
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29 Av. de la Division Leclerc University of New Mexico 
92320 Chátillon Gallup, NM 8730 
France U.S.A. 


Abstract: In this chapter one studies, within the DSmT framework, the case when 
the sources of information provide imprecise belief functions/masses, and we gener- 
alize the DSm rules of combination (classic or hybrid rules) from scalar fusion to 
sub-unitary interval fusion and, more generally, to any set of sub-unitary interval 
fusion. This work generalizes previous works available in literature which appear 
limited to IBS (Interval-valued Belief Structures) in the Transferable Belief Model 
framework. Numerical didactic examples of these new DSm fusion rules for dealing 


with imprecise information are also presented. 
6.1 Introduction 


n the previous chapters, we had focused our efforts on the fusion of precise uncertain and conflicting/- 
| ee generalized basic belief assignments (gbba). We mean here by precise gbba, basic belief 
functions/masses m(.) defined precisely on the hyper-power set DO where each mass m(X), where X 
belongs to D®, is represented by only one real number belonging to [0,1] such that Y yepe m(X) = 1. 
In this chapter, we extend the DSm fusion rules for dealing with admissible imprecise generalized basic 
belief assignments m? (.) defined as real subunitary intervals of [0,1], or even more general as real sub- 
unitary sets [i.e. sets, not necessarily intervals]. An imprecise belief assignment m/(.) over DY is said 


admissible if and only if there exists for every X € DY at least one real number m(X) € m/(X) such that 


123 
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Š xepe M(X) = 1. The idea to work with imprecise belief structures represented by real subset intervals 
of [0,1] is not new and we strongly encourage the reader to examine the previous works of Lamata & 
Moral and also Denceux for instance on this topic in {5} [I] [2] and references therein. The proposed works 
available in the literature, upon our knowledge were limited only to sub-unitary interval combination in 
the framework of Transferable Belief Model (TBM) developed by Smets [12] [13]. We extend the approach 
of Lamata & Moral and Denceux based on subunitary interval-valued masses to subunitary set-valued 
masses; therefore the closed intervals used by Denceux to denote imprecise masses are generalized to any 
sets included in [0,1], i.e. in our case these sets can be unions of (closed, open, or half-open/half-closed) 
intervals and/or scalars all in [0,1]. In this work, the proposed extension is done in the context of the 
DSmT framework, although it can also apply directly to fusion of IBS within TBM as well if the user 
prefers to adopt TBM rather than DSmT. 


In many fusion problems, it seems very difficult (if not impossible) to have precise sources of evidence 
generating precise basic belief assignments (especially when belief functions are provided by human ex- 
perts), and a more flexible plausible and paradoxical theory supporting imprecise information becomes 
necessary. This chapter proposes a new way to deal with the fusion of imprecise, uncertain and con- 
flicting source of information. The section [6.2] presents briefly the DSm rule of combination for precise 
belief functions. In section[6.3] we present the operations on sets for the chapter to be self-contained and 
necessary to deal with imprecise nature of information in our framework. In section [6.4] we propose a 
method to combine simple imprecise belief assignment corresponding only to sub-unitary intervals also 
known as IBS (Interval-valued belief structures) in [I]. In section [6.5] we present the generalization of 
our new fusion rules to combine any type of imprecise belief assignment which may be represented by the 
union of several sub-unitary (half-) open intervals, (half-)closed intervals and/or sets of points belonging 
to [0,1]. Several numerical examples are also given. In the sequel, one uses the notation (a, b) for an open 


interval, [a,b] for a closed interval, and (a, b] or [a, 6) for a half open and half closed interval. 


6.2 Combination of precise beliefs 


6.2.1 General DSm rule of combination 


Let's consider a frame of discernment of a fusion problem O = {61,62,...,9n}, its hyper-power set DO 
(i.e. the set of all propositions built from elements 0; of O with N and U operators (see chapter), and k 
independent (precise) sources of information B1, Ba, ..., By with their associated generalized basic belief 


assignments (gbba) m1(.), ma(.), ..., my(.) defined over DO. Let M be the mass matrix 
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m11 Miz Mid 
mal Maz mad 
M = 
Mk MEQ +... Mkd 
where d = | DÈ | is the dimension of the hyper-power set, and mj; € [0,1] for all 1 < i < k and 


1 < j < d, is the mass assigned by source B; to the element A; € DP. We use the DSm ordering 
procedure presented in chapter B] for enumerating the elements 41, Ao, ..., Ag of the hyper-power set 
D®. The matrix M characterizes all information available which has to be combined to solve the fusion 
problem under consideration. Since m1(.), ma(.), ..., mx(.) are gbba, the summation on each row of 
the matrix must be one. For any (possibly hybrid) model M(®), we apply the DSm general rule of 
combination (also called hybrid DSm rule) for k > 2 sources to fuse the masses (see chapter [4) defined 


for all A € D® as: 


muro (4) © (A) | $1(A) + $2(A) + $9(A) (6.1) 
(A) is the characteristic non emptiness function of the set A, i.e. ¢(A) = 1 if A 0 and ¢(A) = 0 
otherwise. Ø = {0, Øm } represents the set absolutely empty and of all relatively empty elements belonging 
to DP (elements/propositions which have been forced to empty set in the chosen hybrid model M(0)). 
If no constraint is introduced in the model, Ø reduces to {Ø} and this corresponds to the free DSm model 
(see chapterKA). If all constraints of exclusivity between elements 0; € O are introduced, the hybrid model 
M(0) corresponds to Shafer's model on which is based Dempster-Shafer Theory (DST) [9]. $1(A), $2(A) 
and S3(4) are defined by 


S,(A) 4 5 [[ mx) (6.2) 


X1,X2,...,X,EDo i=l 
(X1NX2N...NXk)=A 


k 
$2(A) 3 5 [[ mx) (6.3) 
> EC XpEeo i=l 
[U=A]V[(UED)ACA=14)] 
k 
S3(A) £ y [[ mx) (6.4) 


X1,X2,..,X EDO i=1 
(X1UX23U...UXz)=4 
(X1NX2N...NXk)E0 


where I; 2 01 U02U...U 0n and U £ u(X1)Uu(X2)U...Uu(Xk). u(X) is the union of all singletons 
6; that compose X. For example, if X is a singleton then u(X) = X; if X = 01 N 03 or X = 6, U 2 then 


u(X) = 01 U b2; if X = (0, N 02) U 63 then u(X) = 6; U 02 U 63, etc; by convention u(0) £ 0. 
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6.2.2 Examples 


Let's consider at time t the frame of discernment O = {61, 02,03} and two independent bodies of evidence 


Bı and B with the generalized basic belief assignments m,(.) and ma(.) given by: 





Table 6.1: Inputs of the fusion with precise bba 


Based on the free DSm model and the classical DSm rule (6,2), the combination denoted by the 


symbol © (i.e. m(.) = [mi O ma](.)) of these two precise sources of evidence is 


01 

02 

03 
01 N 02 
01 N 03 
02 N 03 


0117029 03 





Table 6.2: Fusion with DSm classic rule 


Then, assume at time t+1 one finds out for some reason that the free DSm model has to be changed 
by introducing the constraint 01 N 02 = Ø which involves also 0, N 02 03 = Ø. This characterizes the 
hybrid-model M we have to work with. Then one uses the general hybrid DSm rule of combination for 
scalars (i.e. for precise masses m1(.) and ma(.) to get the new result of the fusion at time t+1. According 


to (G1), one obtains m(9, N 02 x 0) = 0, m(0, N 02 N 93 a Ø) = 0 and 
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+ 0.5(0.4)] = 0.26 
+ 0.3(0.4)] = 0.20 








- 0.1(0.4)] = 0.10 
0.16 
0.11 


0 + [0.13] + [0.04] = 0.17 





Table 6.3: Fusion with hybrid DSm rule for model M 
6.3 Operations on sets 


To manipulate imprecise information and for the chapter to be self-contained, we need to introduce 
operations on sets as follows (detailed presentations on Interval Analysis and Methods can be found 
in [3] [4] {6} 01 [8]). The interval operations defined here about imprecision are similar to the rational inter- 
val extension through the interval arithmetics [IO], but they are different from Modal Interval Analysis 
which doesn't serve our fusion needs. We are not interested in a dual of an interval [a, b], used in the 
Modal Interval Analysis, because we always consider a < b, while its dual, Du([a, b]) = [b,a], doesn’t 
occur. Yet, we generalize the interval operations to any set operations. Of course, for the fusion we only 


need real sub-unitary sets, but these defined set operations can be used for any kind of sets. 


Let Sı and S2 be two (unidimensional) real standard subsets of the unit interval [0, 1], and a number 


k € [0,1], then one defines i 


e Addition of sets 








inf (1 Sa) = inf (51) + inf (S2) 























SHS. = SoS, £ {x | £z = 81+582,81 E S1, S2 € S2} with 























sup(S1 H S2) = sup(91) + sup(92) 


and, as a particular case, we have 


inf({k} EB S2) = k + inf (S2) 






































{k} So = Sa {k} = {x | x = k + s2, S2 € S2} with 














sup({k} E S2) = k + sup( S2) 








Examples: 


[0.1, 0.3] EB [0.2, 0.5] = [0.3, 0.8] because 0.1 + 0.2 = 0.3 and 0.3 + 0.5 = 0.8; 





(0.1, 0.3] EB [0.2, 0.5] = (0.3, 0.8); 

















(0.1, 0.3] & (0.2, 0.5] = (0.3, 0.8]; 




















(0.1, 0.3) EB [0.2, 0.5] = [0.3, 0.8); 
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(0.1, 0.3] Œ [0.2, 0.5) = [0.3, 0.8); 





(0.1, 0.3] E (0.2, 0.5) = (0.3, 0.8); 

















[0.7, 0.8] Œ [0.5, 0.9] = [1.2, 1.7]; 




















{0.4} E [0.2, 0.5] = [0.2, 0.5] E {0.4} = [0.6, 0.9] because 0.4 + 0.2 = 0.6 and 0.4 + 0.5 = 0.9; 








{0.4} E (0.2, 0.5] = (0.6, 0.9]; 








{0.4} E [0.2,0.5) = [0.6, 0.9); 





{0.4} E (0.2, 0.5) = (0.6, 0.9). 


e Subtraction of sets 














inf (.S; A S2) = inf(S1) — sup( S2) 




















S1 B Sa 4 {x | £ = 81 — 82,81 € S1, $2 S2} with 




















sup(S1 B S2) = sup( S1) — inf (S2) 


and, as a particular case, we have 














inf({k} H S2) = k — sup(S2) 





{k} B S2 = {x | x = k — 59,52 € So} with 

















sup({k} E S2) = k — inf (S2) 








inf (S2 B {k}) = inf (32) — k 

















and similarly for S2 H {k} with 
sup(S2 B {k}) = sup(S2) — k 





Examples: 


(0.3, 0.7] E [0.2, 0.3] = [0.0, 0.5] because 0.3 — 0.3 = 0.0 and 0.7 — 0.2 = 0.5; 





(0.3, 0.7] E {0.1} = [0.2, 0.6]; 


{0.8} E (0.3, 0.7] = [0.1, 0.5] because 0.8 — 0.7 = 0.1 and 0.8 — 0.3 = 0.5; 





0.1, 0.8] E (0.5, 0.6] = [—0.5, 0.3]; 


0.1, 0.8] E (0.2, 0.9] = [-0.8, 0.6]; 





0.2, 0.5] E [0.1, 0.6] = [—0.4, 0.4]. 





e Multiplication of sets 





inf (1 : Sa) = inf (5) j inf (S2) 














S1 E So = {x | £ = s1 < S2, S1 E S1, S2 € So} with 























sup(S1 © S2) = sup( S1) - sup(S2) 


and, as a particular case, we have 





inf({k} E S2) = k- inf (S2) 

















{k} J] S2 = Sa E {k} = {x | x = k- 82,82 € So} with 























sup({k} © 92) = k - sup(52) 
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Examples: 





(0.1, 0.6] © (0.8, 0.9] = 














(0.1, 0.6] © {0.3} = {0. 





Division of sets 


(0.08, 0.54] because 0.1 - 0.8 = 0.08 and 0.6- 0.9 = 0.54; 





3} © (0.1, 0.6] = [0.03, 0.18] because 0.3 - 0.1 = 0.03 and 0.3 - 0.6 = 0.18. 











In our fusion context, the division of sets is not necessary since the DSm rules of combination 


(classic or hybrid ones) do not require a normalization procedure and thus a division operation. 


Actually, the DSm rules require only addition and multiplication operations. We however give here 


the definition of divisi 


defined as follows: 


on of sets only for the reader’s interest and curiosity. The division of sets is 


inf (1 Y Sa) = inf(S1)/ sup(S2) 


If0 ¢ Sa, then S1852 £ {x | £ = 51/52, 51 € S1, 52 € Sa) with sup(S1 Z S2) = sup(S1)/ inf (S2) if 0 2 Sa 








sup(S1 Ø S2) = +00 if 0 € Sa 


If 0 € S2, then S1 4 S2 = [inf (S1)/ sup( S2), +00) 


and as some particular cases, we have for k Æ 0, 


inf ({k} B Sa) = k/ sup( S2) 


{k} B S2 = {x | y= k/s2, where S2 © So AN {0}} with 


sup({k} Ø S2) = k/inf(S3) 


and if 0 € S2 then sup({k} Z S2) = +00 


One has also as some particular case for k Æ 0, 


inf(S2  {k}) = inf(S2)/k 


So A{k} = {a | £x = s2/k, where s2 € S2} with 


Examples: 


(0.4, 0.6]  [0.1, 0.2] = 








(0.4, 0.6] a {0.4} = [1, 


sup(S2 Ø {k}) = sup(S2)/k 


[2,6] because 0.4/0.2 = 2 and 0.6/0.1 = 6; 


1.5] because 0.4/0.4 = 1 and 0.6/0.4 = 1.5; 


{0.8} Ø (0.2, 0.5] = [1.6, 4] because 0.8/0.2 = 4 and 0.8/0.5 = 1.6; 





0,0.5] @ [0.1, 0.2] = [0 


0, 0.2]) = +00; 





0, 0.9] @ [0, 0.2] = [0, + 





,5]: [0, 0.5] a {0.4} = [0, 1.25] because 0/0.4 = 0 and 0.5/0.4 = 1.25; 


0.3, 0.9] 41 [0, 0.2] = [1.5, +00) because 0.3/0.2 = 1.5 and since 0 € (S2 = [0,0.2]), sup((0.3, 0.9] a 


Hoo): 





{0.7} [0, 0.2] = [3.5, 4 


+oo) because 0.7/0.2 = 3.5 and 0 € (S2 = [0, 0.2]), sup({0.7} 2 [0, 0.2]) = +00; 
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{0} @ [0, 0.2] = [0, +00): [0.3, 0.9] 2 {0} = +00: 
(0, 0.9] 2 {0} = +00: 
(0.2, 0.7] @ [0, 0.8] = [0.25, +00). 
These operations can be directly extended for any types of sets (not necessarily sub-unitary subsets 


as it will be shown in our general examples of section 6), but for simplicity, we will start the presentation 


in the following section only for sub-unitary subsets. 


Due to the fact that the fusion of imprecise information must also be included in the unit interval [0, 1] 
as it happens with the fusion of precise information, if the masses computed are less than 0 one replaces 


them by 0, and similarly if they are greater than 1 one replaces them by 1. For example (specifically in 








our fusion context): [0.2, 0.4] Œ [0.5, 0.8] = [0.7, 1.2] will be forced to [0.7, 1]. 














6.4 Fusion of beliefs defined on single sub-unitary intervals 


6.4.1 DSm rules of combination 


Let's now consider some given sources of information which are not able to provide us a specific /precise 
mass my; € [0,1], but only an interval N in Mij, ie. Tij = [Mij — €ij, Mij + eij] where 0 < eij < 1 
and I; C [0,1] for all 1 < i < k and 1 < j < d. The cases when J,; are half-closed or open are similarly 
treated. 

Lemma 1: if A, B C [0,1] and a € [0,1] then: 


























inf(A O B) = inf (A) - inf (B) inf(A O B) = inf (A) + inf (B) 
sup(A © B) = sup(A) - sup(B) sup(A O B) = sup(4) + sup(B) 
inf(a - A) = a - inf(A) inf(a + A) = a + inf (A) 

supla - A) = a- sup(A) supla + A) = a + sup( A) 


We can regard a scalar a as a particular interval [a, a], thus all operations of the previous lemma 
are reduced to multiplications and additions of sub-unitary intervals. Therefore, the DSm general rule 
(6-1), which operates (multiplies and adds) sub-unitary scalars, can be extended to operate sub-unitary 
intervals. The formula remains the same, but m;(X;), 1 < i < k, are sub-unitary intervals [;;. The 


1This interval centered assumption is not important actually but has been adopted here only for notational convenience. 
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mass matrix M is extended to: 


m1 — €11 M12—€12 «+s Mid— Eld 
. M21 — €21 M22 — E22 ... Mad — €2d 
inf(M) = 
Mki — €k1 Mk2— Ek2 +... ~Mkd Ekd 
[mar +é Mi2+€12 +... Miyat aa] 
ma +€21 M2a2+€2 ... Moat €2d 
sup(M) = 
Mx1 + €k1 Mp2 +€k2 -.. Md + Eka 








Notations: Let's distinguish between DSm general rule for scalars, noted as usual myy(@)(A), or m¡(X;), 


etc., and the DSm general rule for intervals noted as me) (A), or m!(X;), etc. Hence, the DSm general 


rule for interval-valued masses is: 


inf(m44(@)(A)) = $(A) [sim"(A) + SI (A) + 53 (A) (6.5) 
with ; 
sae Y. ][mmix) 
X1,X2,..X¡€eDO9 i=l 
(X1NX2N...NXy)=A4 
k 
s7 (A) ê y [inim x) 
Xi Xo ees; xX, €0 i=1 
(U=A]V[(UED)A(A=14)] 
k 
s3 (A) £ 5 [[ intmi(x%,)) 
X1,X2,..,X€DO i=1 
(X,UXgU...UXp)=A 
(X1NX2N...NXk)E0 
and 
sup(miycey(A)) E 64) [STP (A) + SEPA) + 83"? (4)] (6.6) 
with 


k 
X1,XQ,..., Xy EDO i=l 
(X1NX2M...NX)=A 


k 
S3” (A) = 5 [somi (x) 
X1:X2, Xk €0 i=1 
(U=A]V[(UED)A(A=10)] 


k 
sews Eo Tse) 
X1,X2,..., Xy EDO i=l 
(X1UXQU...UX,)=A 
(X1NX2N...NXp)E0 


Actually formula results from applying the hybrid DSm rule for scalars to the matrix inf (M), 


while formula results from applying the hybrid DSm rule for scalars to the matrix sup(M). The 
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bounds of the DSm classic rule for the free DSm model are given for all A € DP by SPL(A) and S{"P(A). 
Combining and (6.6), one gets directly: 


mio) (A) = [inf mine) (4), sup muo) (A)] (6.7) 


Of course, the closeness of this interval to the left and/or to the right depends on the closeness of the 
combined intervals J;;. If all of them are closed to the left, then me) (A) is also closed to the left. But, 
if at least one is open to the left, then me) (A) is open to the left. Similarly for the closeness to the 
right. Because one has Vi =1,...,k and Vj =1,...,d: 

lim (inf(M)) = lim (sup(M)) = M (6.8) 
It results the following theorem. 
Theorem 1: VA € DÈ, Vi =1,...,k and Vj =1,...,d, one has: 
liminr,, (4) = lime; +0 (inf (m44(@)(A))) 


lim myo) (A) = [lim(4), lim (A)] with 


e430 infij SUP;j 


(6.9) 
limgyp,, (4) Ê lime; olsup (minço) (4)) 
In other words, if all centered sub-unitary intervals converge to their corresponding mid points (the 


imprecision becomes zero), then the DSm rule for intervals converges towards the DSm rule for scalars. 


Normally we must apply the DSm classical or hybrid rules directly to the interval-valued masses, but 
this is equivalent to applying the DSm rules to the inferior and superior bounds of each mass. If, after 
fusion, the sum of inferior masses is < 1 (which occurs all the time because combining incomplete masses 
one gets incomplete results) and the sum of superior masses is > 1 (which occurs all the time because 
combining paraconsistent masses one gets paraconsistent results), then there exist points in each resulted 


interval-valued mass such that their sum is 1 (according to a continuity theorem - see section [6.5.2). 


6.4.2 Example with the DSm classic rule 


Let's take back the previous example (see section[6,2.2), but let's now suppose the sources of information 
give at time t imprecise generalized basic belief assignments, i.e. interval-valued masses centered in the 
scalars given in section [6.2.2] of various radii according to table[6.4] 

Based on the free DSm model and the classical DSm rule applied to imprecise basic belief assignments 


following the method proposed in previous section, one has: 





m! (61) = (0.05, 0.15] © [0.4, 0.6] = [0.020, 0.090] 




















m! (82) = 0.1, 0.3] (3 [0.1, 0.5] = [0.010, 0.150] 














mi (03) = [0.15, 0.45] © [0, 0.2] = [0, 0.090] 
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[0.05,0.15] | [0.4, 0.6] 
[0.1,0.3] | [0.1,0.5] 
[0.15,0.45] | [0, 0.2] 





[0.2, 0.6] | [0.05, 0.15] 


Table 6.4: Inputs of the fusion with imprecise bba 


















































mi (6, N 03) = [[0.05, 0.15] © [0, 0.2]] Œ [[0.4, 0.6] © [0.15, 0.45]] = (0, 0.030] Œ [0.060, 0.270] = [0.060, 0.300] 

















mi (02 N 03) = [[0.1, 0.3] © [0, 0.2]] BB [[0.1, 0.5] © (0.15, 0.45]] = [0, 0.06] Œ [0.015, 0.225] = [0.015, 0.285] 

































































m? (01 N 62 N 63) = [[0.15, 0.45] E] (0.05, 0.15]] EB [[0, 0.2] & [0.2, 0.6]] 


= [0.0075, 0.0675] Œ [0, 0.12] 

















= [0.0075, 0.1875] 

































































mi (0, N 92) = [[0.2, 0.6] © [0.05, 0.15]] Œ [[0.05, 0.15] © [0.05, 0.15]] Œ [[0.4, 0.6] © [0.2, 0.6]] 




















(0.1, 0.3] © [0.05, 0.15]] Œ ([0.1, 0.5] 13 (0.2, 0.6] 


















































[0.05, 0.15]  [0.1, 0.5]] E [[0.4, 0.6] © [0.1, 0.3] 






































II 
oO 


.010, 0.90] Œ [0.0025, 0.0225 


























(0.08, 0.36] E (0.005, 0.045] 
































0.02, 0.30] E [0.005, 0.075] Œ [0.04, 0.18] = [0.1625, 1.0725] = [0.1625, 1] 

















The last equality comes from the absorption of [0.1625, 1.0725] into [0.1625, 1] according to operations on 
sets defined in this fusion context. Thus, the final result of combination m/(.) = [m] @ m](.) of these 


two imprecise sources of evidence is given in table[6.3] 


A [0.020, 0.090] 

b2 [0.010, 0.150] 

63 (0, 0.090] 
0.05 (0.1625, 1.0725 > 1] 
01 003 [0.060, 0.300] 


02 N 03 [0.015, 0.285] 


01092003 [0.0075, 0.1875] 





Table 6.5: Fusion with DSm classic rule for free DSm model 


There exist some points, for example 0.03, 0.10. 0.07, 0.4, 0.1, 0.2, 0.1 from the intervals [0.020, 0.090], ..., 


(0.0075, 0.1875] respectively such that their sum is 1 and therefore the admissibility of the fusion result 
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holds. Note that this fusion process is equivalent to using the DSm classic rule for scalars for inferior 
limit and incomplete information (see table[6.6), and the same rule for superior limit and paraconsistent 


information (see table[67). 


01 N 02 
01 N 03 
02 N 03 
01 N 62 N 03 


0.090 
0.150 
0.090 
01 N b2 i : 1.0725 — 1 
01 N 03 0.300 
02 N 03 0.285 


01 N 02 N 03 0.1875 





Table 6.7: Fusion with DSm classic rule on upper bounds 


6.4.3 Example with the hybrid DSm rule 


Then, assume at time t+ 1, that one finds out for some reason that the free DSm model has to be changed 
by introducing the constraint 0, N 02 = Ø which involves also 0, N 02 N 63 = Ø. One directly applies the 


hybrid DSm rule for set to get the new belief masses: 


























mi (01) = [0.020, 0.090] Œ [[0.05, 0.15] © [0.05, 0.15] EB [[0.4, 0.6] & (0.2, 0.6]] 






























































= [0.020, 0.090] Œ (0.0025, 0.0225] Œ [0.08, 0.36] = (0.1025, 0.4725] 
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m! (62) = (0.010, 0.150] Œ [[0.1, 0.3] © [0.05, 0.15] Œ [[0.1, 0.5] © (0.2, 0.6]] 









































= (0.010, 0.150] Œ [0.005, 0.045] Œ [0.02, 0.30] = [0.035, 0.495] 























mi (63) = [0, 0.090] Œ [[0.15, 0.45] & [0.05, 0.15]] Œ [[0, 0.2] © (0.2, 0.6]] 


















































= [0, 0.090] E [0.0075, 0.0675]  [0, 0.12] = [0.0075, 0.2775] 






























































m! (0, U 02) = [[02, 0.6] © [0.05, 0.15]] Œ [[0.05, 0.15] © (0.1, 0.5]] Œ [[0.4, 0.6] (3 (0.1, 0.3]] 
























































= (0.010, 0.090] Œ [0.005, 0.075] Œ [0.04, 0.18] = [0.055, 0.345] 











m! (0,9 02) = m? (0, N 62 03) = 0 by definition of empty masses (due to the choice of the hybrid 
model M). m/(6; N 03) = [0.060, 0.300] and m!(02 N 63) = [0.015, 0.285] remain the same. Finally, the 


result of the fusion of imprecise belief assignments for the chosen hybrid model M, is summarized in 


table [6.8] 
m(A) = m“ (A), mP (A) 


0 [0.1025, 0.4725] 
0. [0.035, 0.495] 
0 [0.0075,0.2775] 


6,902 20 [0,0] = 0 
61.65 [0, 060, 0.300] 
02 N 03 [0.015, 0.285] 
6.9 0,03 g [0,0] = 0 
61 U b2 [0.055, 0.345] 





Table 6.8: Fusion with hybrid DSm rule for model M 


The admissibility of the fusion result still holds since there exist some points, for example 0.1, 0.3, 0.1, 
0, 0.2, 0.1, 0,0.2 from the intervals [0.1025, 0.4725], ..., [0.055, 0.345] respectively such that their sum is 
1. Actually in each of these examples there are infinitely many such groups of points in each respective 


interval whose sum is 1. This can be generalized for any examples. 


6.5 Generalization of DSm rules for sets 


In this section, we extend the previous results on the fusion of admissible imprecise information defined 


only on single sub-unitary intervals to the general case where the imprecision is defined on sets. In 
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other words, in the previous section we dealt with admissible imprecise masses having the form m? (A) = 
[a,b] C [0, 1], and now we deals with admissible imprecise masses having the form m/(A) = [a1,b1]U...U 


(Am, bm] U (c1, d1) U...U (en, dn) U (e1, fil U.. -U (ep, fpl U l[g1,h1)U...Ulgg, ha) U {41, . .-, Ar} where all 


the bounds or elements involved into m*(4) belong to [0, 1]. 


6.5.1 General DSm rules for imprecise beliefs 
From our previous results, one can generalize the DSm classic rule from scalars to sets in the following 


way: VA # 0 € DP, 
m! (A) Z [>| 1] e mi (6.10) 


> ERPE IN XpEDO9 i= i A 
(X1NX2N...NXk)=A 


where and represent the summation, and respectively product, of sets. 


Similarly, one can generalize the hybrid DSm rule from scalars to sets in the following way: 


























minor) E 0(4) 0 [S1 (4) E S4(4) E s4(4)| (6.11) 


(A) is the characteristic non emptiness function of the set A and S? (A), S3(A) and S4(A) are defined 


b 
i Si (A) £ mi (6.12) 


X1,X25,..., >| ql 3 PA 
(X1NX2N...NXp)=A 


si(A) 2 TT] m (6.13) 
i k 


Xi, Xp E0 
[u=A]V ere 14)] 


sia) 2 >] IL ptes au (6.14) 


X1,X2,..., Ky 6 DOL o 
(X{UXQU...UX,)=A 
(X1NX2N...NXk)E0 


In the case when all sets are reduced to points (numbers), the set operations become normal operations 


with numbers; the sets operations are generalizations of numerical operations. 


6.5.2 Some lemmas and a theorem 


Lemma 2: Let the scalars a,b > 0 and the intervals I1, I2 C [0,1], with a € Iı and b € Ig. Then 











obviously (a + b) € Bly and (a-b) € h Ol. 























Because in DSm rules of combining imprecise information, one uses only additions and subtractions of 


sets, according to this lemma if one takes at random a point of each mass set and one combines them 
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using the DSm rules for scalars, the resulting point will belong to the resulting set from the fusion of 


mass sets using the DSm rules for sets. 


Lemma 3: Let O = {6,,62,...,9n} and K > 2 independent sources of information, and d = dim(D®). 


By combination of incomplete information in DSmT, one gets incomplete information. 


Proof: Suppose the masses of the sources of information on DY are for all 1 < j < K, represented 
by the mass-vector mj = [m;,,™m,,,...,m,,] with 0 < eae Mj, < 1. According to the DSm network 
architecture, no matter what DSm rule of combination is applied (classic or hybrid), the sum of all 


resulted masses has the form: 


K 

[Mma +m, +...+mi) <(Ux1x...x1)=1 (6.15) 
A aa 

j=1 K times 


Lemma 4: By combination of paraconsistent information, one gets paraconsistent information. 


Proof: Using the same notations and similar reasoning, one has for alll < j < K, mj = [mj,,m,,,...,Mjq], 


with pei mj, > 1. Then 


K 

[Mma tme. +m) > (1x1x...x1)=1 
— pr 

j=1 


K times 
Lemma 5: Combining incomplete (sum of masses < 1) with complete (sum of masses = 1) information, 


one gets incomplete information. 
Lemma 6: Combining complete information, one gets complete information. 


Remark: Combining incomplete with paraconsistent (sum of masses > 1) information can give any 


result. For example: 


e Tf the sum of masses of the first source is 0.99 (incomplete) and the sum of masses of the second source 


is 1.01 (paraconsistent), then the sum of resulted masses is 0.99 x 1.01 = 0.9999 (i.e. incomplete) 


e But if the first is 0.9 (incomplete) and the second is 1.2 (paraconsistent), then the resulted sum of 


masses is 0.9 x 1.2 = 1.08 (i.e. paraconsistent). 


We can also have: incomplete information fusionned with paraconsistent information and get complete 


information. For example: 0.8 x 1.25 = 1. 
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Admissibility condition: 


An imprecise mass on DY is considered admissible if there exist at least a point belonging to (0, 1] in 
each mass set such that the sum of these points is equal to 1 (i.e. complete information for at least a 


group of selected points). 


Remark: A complete scalar information is admissible. Of course, for the incomplete scalar information 
and paraconsistent scalar information there can not be an admissibility condition, because by definitions 


the masses of these two types of informations do not add up to 1 (i.e. to the complete information). 
Theorem of Admissibility: 


Let a frame O = {61,62,...,9n}, with n > 2, its hyper-power set DÌ? with dim(D®) = d, and K > 2 
sources of information providing imprecise admissible masses on DY. Then, the resulted mass, after 
fusion of the imprecise masses of these sources of information with the DSm rules of combination, is also 


admissible. 


Proof: Let sj, 1 < j < K, be an imprecise source of information, and its imprecise admissible mass 


m! = |m} ,m1.,...,m! ]. We underline that all m/_, for 1 < r < d, are sets (not scalars); if there is a 
I J1 J2 Ja Jr 


scalar a, we treat it as a set [a, a]. Because m; is admissible, there exist the points (scalars in [0, 1]) 


I 


I I d z : 
jo Mi, E Mhar- Mi, E Mja such that J mj, = 1. This property occurs for all sources of 


mi, Em 
information, thus there exist such points mj for any 1 < j < K and any 1 < r < d. Now, if we fusion, 
as a particular case, the masses of only these points, using DSm classic or hybrid rules, and according to 


lemmas, based on DSm network architecture, one gets complete information (i.e. sum of masses equals 


to 1). See also Lemma 2. 


6.5.3 An example with multiple-interval masses 


We present here a more general example with multiple-interval masses. For simplicity, this example is a 
particular case when the theorem of admissibility is verified by a few points, which happen to be just on 
the bounders. More general and complex examples (not reported here due to space limitations), can be 
given and verified as well. It is however an extreme example, because we tried to comprise all kinds of 
possibilities which may occur in the imprecise or very imprecise fusion. So, let’s consider a fusion problem 
over O = {61,42}, two independent sources of information with the following imprecise admissible belief 


assignments 
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(0.1, 0.2] U {0.3} (0.4, 0.5] 
(0.4, 0.6) U [0.7, 0.8] | [0, 0.4] U {0.5, 0.6} 





Table 6.9: Inputs of the fusion with imprecise bba 


Using the DSm classic rule for sets, one gets 


m” (01) = ([0 


= (0 


1, 0.2] © (0.4, 0.5]) U ({0.3} E 











1, 0.2] U {0.3}) & [0.4, 0.5] 





























= (0.04, 0.10] U [0.12, 0.15] 





m! (02) = ((0.4, 0.6) U [0.7, 0.8]) © ([0, 























= ((0.4, 0.6) E [0, 0.4]) U ((0.4, 


0.4] U {0.5, 0.6}) 





(0.4, 0.5]) 


























0.6) © {0.5, 0.6}) U ((0.7, 0.8] E 


[0, 0.4]) U ([0.7, 0.8] © {0.5, 0.6}) 











= (0, 0.24) U (0.20, 0.30) U (0.24, 0.36) U [0, 0.32] U [0.35, 0.40] U [0.42, 0.48] 





= [0, 0.40] U [0.42, 0.48] 





m! (0, N82) = [((0.1, 0.2] U {0.3} 1 ({0, 0.4] U {0.5, 0.6})] E [[0.4, 0.5] E 























= [((0.1, 0.2] © [0,0.4]) U ( 




















[0.1, 0.2] © {0.5, 0.6}) U ({0.3} E 














E [([0.4, 0.5] [3 (0.4, 0.6)) U ([0.4, 0.5] B [0.7, 0.8])] 





























((0.4, 0.6) U [0.7, 0.8])] 

















[0, 0.4]) U ({0.3} E {0.5, 0.6})] 
































= [[0, 0.08] U [0.05, 0.10] U [0.06, 0.12] U [0, 0.12] U {0.15, 0.18}] Œ [(0.16, 0.30) U [0.28, 0.40]] 


= [[0, 0.12] U {0.15, 0.18}] 


= (0.16, 0.52] U (0.31, 0.55 





= (0.16, 0.58] 











1 (0.16, 0.40] 








U (0.34, 0.58] 


Hence finally the fusion admissible result is given by: 


m! (A) = [ml @ md](A) 





(0.04, 0.10] U (0.12, 0.15] 


[0, 0.40] U [0.42, 0.48] 
(0.16, 0.58] 
0 


Table 6.10: Fusion result with the DSm classic rule 
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If one finds out that 01M 92 E Ø (this is our hybrid model M one wants to deal with), then one uses the 
hybrid DSm rule for sets (GID): m44 (61 N 02) = 0 and m44 (01 U 02) = (0.16, 0.58], the others imprecise 


masses are not changed. In other words, one gets now with hybrid DSm rule applied to imprecise beliefs: 


mba) = [mf e mál(4) 


(0.04, 0.10] U [0.12, 0.15] 


[0, 0.40] U [0.42, 0.48] 
0 
(0.16, 0.58] 





Table 6.11: Fusion result with the hybrid DSm rule for M 


Let's check now the admissibility conditions and theorem. For the source 1, there exist the pre- 
cise masses (m1(0,) = 0.3) € ([0.1,0.2] U {0.3}) and (m1(02) = 0.7) € ((0.4,0.6) U [0.7,0.8]) such 
that 0.3 + 0.7 = 1. For the source 2, there exist the precise masses (m1(0,) = 0.4) € ([0.4,0.5]) and 
(m2(62) = 0.6) € ([0, 0.4] U (0.5, 0.6}) such that 0.4 + 0.6 = 1. Therefore both sources associated with 


ml (.) and mi(.) are admissible imprecise sources of information. 


It can be easily checked that the DSm classic fusion of m1(.) and ma(.) yields the paradoxical basic 
belief assignment m(0,) = [mı © ma](01) = 0.12, m(02) = [mı O ma](02) = 0.42 and m(01 N 02) = 
[mi 6 maJ(01 N 02) = 0.46. One sees that the admissibility theorem is satisfied since (m(9,) = 0.12) € 
(m? (01) = [0.04, 0.10] U[O.12, 0.15]), (m(92) = 0.42) € (m? (02) = [0, 0.40] U[0.42, 0.48]) and (m(41 N02) = 
0.46) € (m!*(01 N 02) = (0.16,0.58]) such that 0.12 + 0.42 + 0.46 = 1. Similarly if one finds out that 
61 N 02 = , then one uses the hybrid DSm rule and one gets: m(61 N 92) = 0 and m(41 U 62) = 0.46; the 


others remain unchanged. The admissibility theorem still holds. 


6.6 Conclusion 


In this chapter, we proposed from the DSmT framework, a new general approach to combine, imprecise, 
uncertain and possibly paradoxical sources of information to cover a wider class of fusion problems. This 
work was motivated by the fact that in most of practical and real fusion problems, the information is 
rarely known with infinite precision and the admissible belief assignment masses, for each element of the 
hyper-power set of the problem, have to be taken/chosen more reasonably as sub-unitary (or as a set of 
sub-unitary) intervals rather than a pure and simple scalar values. This is a generalization of previous 
available works proposed in literature (mainly IBS restricted to TBM framework). One showed that it 
is possible to fusion directly interval-valued masses using the DSm rules (classic or hybrid ones) and 


the operations on sets defined in this work. Several illustrative and didactic examples have been given 
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throughout this chapter to show the application of this new approach. The method developed here can 


also combine incomplete and paraconsistent imprecise, uncertain and paradoxical sources of information 


as well. This approach (although focused here only on the derivation of imprecise basic belief assignments) 


can be extended without difficulty to the derivation of imprecise belief and plausibility functions as well 


as to imprecise pignistic probabilities according to the generalized pignistic transformation presented in 


chapter] This work allows the DSmT to cover a wider class of fusion problems. 
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Abstract: This chapter introduces a generalized pignistic transformation (GPT) 
developed in the DSmT framework as a tool for decision-making at the pignistic 
level. The GPT allows to construct quite easily a subjective probability measure 
from any generalized basic belief assignment provided by any corpus of evidence. We 
focus our presentation on the 8D case and we provide the full result obtained by the 


proposed GPT and its validation drawn from the probability theory. 


This chapter is based on a paper [3] and is reproduced here with permission of the International Society of Information 


Fusion. 1. Milan Daniel thanks the COST action 274 TARSKI for supporting this work. 
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7.1 A short introduction to the DSm cardinality 


ne important notion involved in the definition of the Generalized Pignistic Transformation (GPT) 
O; the DSm cardinality introduced in chapter [B] (section 8.2.2) and in [I]. The DSm cardinality of 
any element A of hyper-power set DÌ, denoted C4(A), corresponds to the number of parts of A in the cor- 
responding fuzzy /vague Venn diagram of the problem (model M) taking into account the set of integrity 
constraints (if any), i.e. all the possible intersections due to the nature of the elements 0,. This intrinsic 
cardinality depends on the model M (free, hybrid or Shafer’s model). M is the model that contains A, 
which depends both on the dimension n = |8| and on the number of non-empty intersections present in 
its associated Venn diagram. The DSm cardinality depends on the cardinal of O = [6,,02,...,0,) and 
on the model of DY (i.e., the number of intersections and between what elements of O - in a word the 
structure) at the same time; it is not necessarily that every singleton, say 0;, has the same DSm cardinal, 
because each singleton has a different structure; if its structure is the simplest (no intersection of this 
elements with other elements) then C1u(0,) = 1, if the structure is more complicated (many intersections) 
then Cruu(0;) > 1; let's consider a singleton 0,: if it has 1 intersection only then Cm(0;) = 2, for 2 inter- 
sections only Cm (0;) is 3 or 4 depending on the model M, for m intersections it is between m + 1 and 2” 
depending on the model; the maximum DSm cardinality is 271 and occurs for 6; U62U...U6,, in the free 
model M/; similarly for any set from D®: the more complicated structure it has, the bigger is the DSm 
cardinal; thus the DSm cardinality measures the complexity of en element from D®, which is a nice char- 
acterization in our opinion; we may say that for the singleton 6; not even |O] counts, but only its structure 
(= how many other singletons intersect 0,). Simple illustrative examples have already been presented in 
chapter B] One has 1 < Cyy(A) < 2” — 1. Cm(A) must not be confused with the classical cardinality 


|A| of a given set A (i.e. the number of its distinct elements) - that’s why a new notation is necessary here. 


It has been shown in [I], that Cm(A), is exactly equal to the sum of the elements of the row of D,, 
corresponding to proposition A in the u, basis (see chapter). Actually Cm(A) is very easy to compute 


by programming from the algorithm of generation of DP given in chapter PJand in PJ. 
8 8 8 g 


If one imposes a constraint that a set B from DY is empty (i.e. we choose a hybrid DSm model), 
then one suppresses the columns corresponding to the parts which compose B in the matrix D,, and the 
row of B and the rows of all elements of DP which are subsets of B, getting a new matrix D’,, which 
represents a new hybrid DSm model M’. In the u, basis, one similarly suppresses the parts that form 


B, and now this basis has the dimension 2” — 1 — Cm (B). 
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7.2 The Classical Pignistic Transformation (CPT) 


We follow here Smets’ point of view [8] about the assumption that beliefs manifest themselves at two 
mental levels: the credal level where beliefs are entertained and the pignistic level where belief functions 
are used to make decisions. Pignistic terminology has been coined by Philippe Smets and comes from 
pignus, a bet in Latin. The probability functions, usually used to quantify the beliefs at both levels, 
are actually used here only to quantify the uncertainty when a decision is really necessary, otherwise we 
argue as Philippe Smets does, that beliefs are represented by belief functions. To take a rational decision, 
we propose to transform generalized beliefs into pignistic probability functions through the Generalized 
Pignistic Transformation (the GPT) which will be presented in the following. We first recall the Classical 
Pignistic Transformation (the CPT) based on Dempster-Shafer Theory (DST) and then we generalize it 


within the Dezert-Smarandache Theory (DSmT) framework. 


When a decision must be taken, we use the expected utility theory which requires to construct a proba- 
bility function P{.} from basic belief function m(.) [B]. This is achieved by the so-called classical Pignistic 
Transformation. In the Transferable Belief Model (the TBM) context [7] with open-world assumption, 
Philippe Smets derives the pignistic probabilities from any non normalized basic belief assignment m(.) 


(ie. for which m(0) > 0) by the following formula [8]: 





= |X A] m(X) 
PAS 2 TO) (7.1) 


where |A| denotes the number of worlds in the set A (with convention ||/|@| = 1, to define P(0)). 
P( A) corresponds to BetP(A) in Smets’ notation [8]. Decisions are achieved by computing the expected 
utilities of the acts using the subjective/pignistic Pf.) as the probability function needed to compute 
expectations. Usually, one uses the maximum of the pignistic probability as decision criterion. The max. 
of P{.} is often considered as a prudent betting decision criterion between the two other alternatives (max 
of plausibility or max. of credibility which appears to be respectively too optimistic or too pessimistic). 


It is easy to show that P{.} is indeed a probability function (see [7]). 


It is important to note that if the belief mass m(.) results from the combination of two independent 
sources of evidence (i.e. m(.) = [m1 6 ma](.)) then, at the pignistic level, the classical pignistic probabil- 
ity measure P(.) remains the same when using Dempster’s rule or when using Smets’ rule in his TBM 
open-world approach working with m(@) > 0. Thus the problem arising with the combination of highly 
conflicting sources when using Dempster’s rule (see chapter B), and apparently circumvented with the 
TBM at the credal level, still fundamentally remains at the pignistic level. The problem is only trans- 


ferred from credal level to pignistic level when using TBM. TBM does not help to improve the reliability 
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of the decision-making with respect to Dempster’s rule of combination because the pignistic probabilities 
are strictly and mathematically equivalent. In other words, if the result of the combination is wrong or 
at least very questionable or counter-intuitive when the degree of the conflict m(@) becomes high, then 


the decision based on pignistic probabilities will become inevitably wrong or very questionable too. 


Taking into account the previous remark, we rather prefer to adopt from now on the classical 
Shafer’s definition for basic belief assignment m(.) : 2° — [0,1] which imposes to take m(Ø) = 0 and 
X xe20 M(X) = 1. We adopt therefore the following definition for the Classical Pignistic Transformation 
(CPT): 





Pa) APA mex) (72) 
X€E2° 


7.3 A Generalized Pignistic Transformation (GPT) 


7.3.1 Definition 


To take a rational decision within the DSmT framework, it is necessary to generalize the Classical Pignistic 
Transformation in order to construct a pignistic probability function from any generalized basic belief 
assignment m(.) drawn from the DSm rules of combination (the classic or the hybrid ones - see chapter 
Ø. We propose here the simplest and direct extension of the CPT to define a Generalized Pignistic 
Transformation as follows: 


Cu(X A) 


CMD m(X) (7.3) 


VA € D®, PA) = Y 


XEDO 
where Cm(X) denotes the DSm cardinal of proposition X for the DSm model M of the problem under 


consideration. 


The decision about the solution of the problem is usually taken by the maximum of pignistic proba- 
bility function P{.}. Let's remark the close ressemblance of the two pignistic transformations (7.2) and 
(23). It can be shown that (Z3) reduces to (22) when the hyper-power set DY reduces to classical power 
set 29 if we adopt Shafer’s model. But (Z3) is a generalization of (Z2) since it can be used for computing 


pignistic probabilities for any models (including Shafer’s model). 


7.3.2 Pf.) is a probability measure 


It is important to prove that P{.} built from GPT is indeed a (subjective/pignistic) probability measure 


satisfying the following axioms of probability theory [4] [5]: 
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e Axiom 1 (nonnegativity): The (generalized pignistic) probability of any event A is bounded by 0 
and 1 


0< PLA < 1 
e Axiom 2 (unity): Any sure event (the sample space) has unity (generalized pignistic) probability 


PIS)=1 


e Axiom 8 (additivity over mutually exclusive events): If A, B are disjoint (i.e. AMB = Ø) then 
P(AU B) = P(A) + P(B) 


The axiom 1 is satisfied because, by the definition of the generalized basic belief assignment m(.), one 
has Va; € DP, 0 < m(ai) < 1 with aepo m/(a;) = 1 and since all coefficients involved within GPT 


are bounded by 0 and 1, it follows directly that pignistic probabilities are also bounded by 0 and 1. 


The axiom 2 is satisfied because all the coefficients involved in the sure event S £ 6; U2 U... U Ôn 


are equal to one because Cm(XNS)/Cm(X) = Cm(X)/Cm(X) = 1, so that P{S} = a, ¿pe mai) = 1. 


The axiom 3 is satisfied. Indeed, from the definition of GPT, one has 


Cu(XN(AUB)) 


P{AUB}= Y YES 


XEDO 


m(X) (7.4) 
But if we consider A and B exclusive (i.e. AN B = ()), then it follows: 
Cu(X ON (AU B)) =Cy((X NA) U (XN B)) =Cy(XN A) + Cu(XN B) 


By substituting Cu(X N (AU B)) by Cm(X N A) +Cm(X N B) into Œ, it comes: 


Cu(X NA) + Cu (X NAB) 


P{AUB}= Y YES) m(X) 
XEDOS 
7 Cu(XNA) Cu(X n B) 
a 0 ag 0 
= P{A} + P{B} 


which completes the proof. From the coefficients Cus Os) involved in (7.3), it can also be easily checked 


that AC B => P{A} < P{B}. One can also easily prove the Poincaré’ equality: P{AU B} = P{A}+ 
PIB) — P{AN B} because Cm(X N (AU B) =Cm((X N A)U(XNB)) =Cy(XN A) + Cu (XN B)- 
Cu(X N (AN B)) (one has substracted Cm(X N (AN B)), i.e. the number of parts of X N (AN B) in the 
Venn diagram, due to the fact that these parts were added twice: once in Cm(X N A) and second time 


in Cu (XB). 
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7.4 Some examples for the GPT 


7.4.1 Example for the 2D case 


e With the free DSm model: 


Let's consider O = (01,02) and the generalized basic belief function m(.) over the hyper-power set 
D® =(0,0, 62,61, 02,01 U 92}. It is easy to construct the pignistic probability P{.}. According 
to the definition of the GPT given in (Z3), one gets: 











P{0} =0 
P(01) = m (01) + zm(02) + m(0 002) + Em (01 U0») 
P {02} = m(B2) + 5mí0) + m(01 N 2) + Žm(01 U 62) 
PLO, N 62) = Íme») + 5m (01) + m(01 N 02) + ime U 62) 


It is easy to prove that 0 < Pf.) < 1 and P{01 U 02} = P101} + P102} — P101 N 02) 


e With Shafer’s model: 


WS, 


If one adopts Shafer’s model (we assume 6; N 02 Ø), then after applying the hybrid DSm rule of 
combination, one gets a basic belief function with non null masses only on 01, 02 and 6; U 02. By 


applying the GPT, one gets: 





PI0)=0 
P4101 N 02} =0 
P{01} =m(01) + 5m(0r U b2) 
P{02} = m(82) + 5m(01 U b2) 


P40: U 02) = m(61) + m(02) + m(0ı U 92) =:1 
which naturally corresponds in this case to the pignistic probability built with the classical pignistic 
transformation (2). 
7.4.2 Example for the 3D case 


e With the free DSm model: 
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Cm(XNas) < Cm(XNaro) Cm(XNas) Cm(X Naio) 


CMA) = Cm(X) Cm (X 











Table 7.1: Coefficients Cag Oe) and ME so} 


Let's consider O = {01,02,03}, its hyper-power set DO = {ag,..., aig} (with as, i = 0,...,18 
corresponding to propositions shown in table [B.I] of chapter] and the generalized basic belief as- 
signment m(.) over the hyper-power set DY. The six tables presented in the appendix show the full 
derivations of all generalized pignistic probabilities P{a;} for i = 1,...,18 (P{0} = 0 by definition) 
according to the GPT formula (Z3). 


Note that P{a g} = 1 because (6; U 62 U 63) corresponds to the sure event in our subjective prob- 
ability space and >, epe (ai) = 1 by the definition of any generalized basic belief assignment 


m(.) defined on D®. 


It can be verified (as expected) on this example, although being a quite tedious task, that Poincaré’ 


s equality holds: 


P{A\U...UAn}= Y (-2)" PEC) Ai} (7.5) 
Ic{l,...,n} ¡el 
140 


It is also easy to verify that VA C B > P{A} < P{B} holds. By example, for (as $ (01U63)N02) C 
a10 = 02) and from the expressions of P{ag} and P{aj9} given in appendix, we directly conclude 
that P{ag} < P{a10} because 


Cu(X N as) P Cu(X N aio) 


YX € D®Ħ, < 
Cm(X) Cm(X) 


(7.6) 


as shown in the table above. 
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e Example with a given hybrid DSm model: 


Consider now the hybrid DSm model M 4 Mf in which we force all possible conjunctions to be 
empty, but 01 N @2 according to the second Venn diagram presented in Chapter B] and shown in 
Figure B2] In this case the hyper-power set DY reduces to 9 elements [a0,..., ag) shown in table 
[3.2] of Chapter B] The following tables present the full derivations of the pignistic probabilities 
Pla) for i=1,...,8 from the GPT formula (Z3) applied to this hybrid DSm model. 


P(01) = Plo2) = Pio) = Plas} = 
(1/1)m(ar) |(0/1)m(a1) | (1/)m(a) | (1/1)m(ar) 
























































Table 7.3: Derivation of Plas 2 01 U 02), P{ag £ 01 U 03), P{az 2 02 U 03) and Plas £ 01 U 02 U 03) 


e Example with Shafer’s model: 


Consider now Shafer’s model M° 4 Mf in which we force all possible conjunctions to be empty 


according to the third Venn diagram presented in Chapter B] In this case the hyper-power set 
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D® reduces to the classical power set 2° with 8 elements {ao,...,Q@7} explicated in table B.3] of 
Chapter B] Applying, the GPT formula (23), one gets the following pignistic probabilities P{a;} 
for i = 1,...,7 which naturally coincide, in this particular case, with the values obtained directly 


by the classical pignistic transformation (7.3): 


Play} = P{az} = Plas} = 
(1/1)m(a1) | (0/1)m(a1) | (0/1)m(a1) 


Q2 
























Table 7.4: 


(1/1)m(a1) 


























Table 7.5: Derivation of Play £ 01 Ube}, Plas £ 01 U03}, Plo6 £ 02 U03} and P{a7 2 01 U02 U03} = 1 


7.5 Conclusion 


A generalization of the classical pignistic transformation developed originally within the DST framework 
has been proposed in this chapter. This generalization is based on the new theory of plausible and 
paradoxical reasoning (DSmT) and provides a new mathematical issue to help the decision-making under 
uncertainty and paradoxical (i.e. highly conflicting) sources of information. The generalized pignistic 
transformation (GPT) proposed here allows to build a subjective/pignistic probability measure over the 
hyper-power set of the frame of the problem under consideration for all kinds of models (free, hybrid 
or Shafer’s model). The GPT coincides naturally with the classical pignistic transformation whenever 


Shafer’s model is adopted. It corresponds with the assumptions of classical pignistic probability general- 
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ized to the free DSm model. A relation of GPT on general hybrid DSm models to assumptions of classical 


PT is still in the process of investigation. Several examples for the 2D and 3D cases for different kinds 


of models have been presented to illustrate the validity of the GPT. 
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Appendix: Derivation of the 
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Derivation of P{a1}, P{a2} and P{a3} Derivation of P{a10}, P{a11} and P{aj2} 
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Chapter 8 


Probabilized logics related to DSmT 


and Bayes inference 


Frédéric Dambreville 
Délégation Générale pour ’ Armement, DGA/CTA/DT/GIP/PRO 
16 Bis, Avenue Prieur de la Cóte d'Or 
94114, Arcueil Cedex France 


Abstract: This work proposes a logical interpretation of the non hybrid Dezert 
Smarandache Theory (DSmT). As probability is deeply related to a classical seman- 
tic, it appears that DSmT relies on an alternative semantic of decision. This se- 
mantic is characterized as a probabilized multi-modal logic. It is noteworthy that 
this interpretation justifies clearly some hypotheses usually made about the fusion 
rule (ie. the independence between the sensors). At last, a conclusion arises: there 
could be many possible fusion rules, depending on the chosen semantic of decision; 
and the choice of a semantic depends on how the actual problem is managed. Illus- 
trating this fact, a logical interpretation of the Bayesian inference is proposed as a 


conclusion to this chapter. 


8.1 Introduction 


Y Ñ T hen a non deterministic problem appears to be too badly shaped, it becomes difficult to make a 

coherent use of the probabilistic models. A particular difficulty, often neglected, comes from the 
interpretation of the raw data. The raw data could have a good probabilistic modelling, but in general 
such informations are useless: an interpretation is necessary. Determining the model of interpretation, 


and its probabilistic law, is the true issue. Due to the forgotten/unknown case syndrome, it is possible 
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that such model cannot be entirely constructed. In some cases, only a rather weak approximation of the 
model is possible. Such approximated model of interpretation may produce paradoxical results. This is 


particularly true in information fusion problems. 


Several new theories have been proposed for managing these difficulties. Dempster Shafer Theory 
of evidence [i] [5] is one of them. In this paper, we are interested in the Dezert Smarandache Theory 
(DSmT) [B], a closely related theory. These theories, and particularly the DSmT, are able to manipulate 
the model contradictions. But a difficulty remains: it seems uneasy to link these various theories. In 
particular, their relation with the theory of probability seems unclear. Such a relation is perhaps not 
possible, as could claim some authors, but it is necessary: it is sometimes needed to combine methods 
and algorithms based on different theories. This paper intends to establish such relations. A probabilized 
multi-modal logic is constructed. This probabilized logic, intended for the information fusion, induces the 
same conjunctive fusion operator as DSmT (ie. operator @). By the way, the necessity of independent 
sources for applying the operator € is clarified and confirmed. Moreover, this logical interpretation in- 
duces a possible semantic of the DSmT, and somehow enlightens the intuitions behind this theory. Near 
the end, the paper keeps going by giving a similar interpretation of the Bayes inference. Although the 
Bayes inference is not related to the DSmT, this last result suggests that probabilized logics could be a 


possible common frame for several non deterministic theories. 


Section [82] is beginning by a general discussion about probability. It is shown that probabilistic 
modellings are sometimes questionable. Following this preliminary discussion, two versions of the theory 
of evidence are introduced: the historical Dempster Shafer Theory and the Transferable Belief Model 
of Smets [8]. Section [8.3] makes a concise presentation of the Dezert Smarandache Theory. The short 
section [8-4] establishes some definitions about probability (and partial probability) over a set of logical 
propositions. These general definitions are needed in the following sections. Section [8.5] gives a logical 
interpretation of the DSmT on a small example. This section does not enter the theory too deeply: 
the modal logic associated to this interpretation is described with practical words, not with formulae! 
Section 8.6] generalizes the results to any cases. This section is much more theoretic. The modal logic is 
defined mathematically. Section [8.7] proposes a similar logical interpretation of the Bayesian inference. 


Section [8.8] concludes. 
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8.2 Belief Theory Models 


8.2.1 Preliminary: about probability 


This subsection argues about the difficulty to modelize “everything” with probability. Given a measurable 
universe of abstract events (or propositions) Q = {w;, i € I}, a probability P could be defined as a bounded 


and normalized measure over Q. In this paper, we are interested in finite models (J is finite). 


A probability P could also be defined from the probabilities p(w) of the elementary events w € Q. The 


density of probability p should verify (finite case) : 
p: 24 R7, 


and: 


Y pw) =1. 


wen 


The probability P is recovered by means of the additivity property: 
VA CO, P(A)= X plo). 
WEA 

It is important to remember how such abstract definitions are related to a concrete notion of “chance” 
in the actual universe. Behind the formalism, behind the abstract events, there are actual events. The 
formalism introduced by the abstract universe Q is just a modelling of the actual universe. Such a 
model is expected to be more suitable to mathematical manipulations and reasoning. But there is no 
reason that these actual events are compatible with the abstract events. Probability theory assumes 
this compatibility. More precisely, probability assumes that either the abstract and actual events are 
the same, either there is a mapping from the actual events to the abstract events (figure B.I). When this 
mapping hypothesis is made, the density function makes sense then, in regard to the observation. Indeed, 


a practical construction of p becomes possible with a frequentist taste: 
1. Set p(w) =0 for allw EQ, 
2. Make N tossing of an actual event. For each tossed event, a, do: 
(a) Select the w € Q such that a maps to w, 
(b) Set pw) = pw) +1, 
3. Set p(w) = p(w) for allw EQ. 


The next paragraph explains why the mapping from the actual events to the abstract events is not always 


possible and how to overcome this difficulty. 


158 CHAPTER 8. PROBABILIZED LOGICS RELATED TO DSMT AND BAYES INFERENCE 


























Actual universe (observations) Abstract universe (representation) 
x x x ° x — x — x . 
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> 
o o e o e o e o 
e e e e e O e o 


An abstract event is a connected component; in this example, 


the x-ed observations map to the unique x-ed component 


Figure 8.1: Event mapping: probabilist case 


8.2.1.1 The impossible quest of the perfect universe 


It is always possible to assume that there is a perfect universe, where all problems could be modeled, but 
we are not able to construct it or to manipulate it practically. However, we are able to think with it. Let 


A be the actual universe, let Q be the abstract universe, and let Z be this perfect universe. 


The structure of Q is well known; it describes our modelling of the actual world. This is how we interpret 
the observations. Practically, such interpretation is almost always necessary, while the raw observation 
may be useless. But Q is only an hypothesis: our knowledge about the observation is generally insufficient 


for a true interpretation. 


The universe A is observed, but like Z its structure is not really known: although an observation is 
possible, it is not necessary possible to know the meaning, the true interpretation, of this observation. 


For example, what is the meaning of an observation for a situation never seen before? 


The universe Z is perfect, which means that it contains the two other, and is unknown. The word contains 
has a logical signification here, ie. the events/propositions of A or Q are macro-events/macro-propositions 


of Z (figure B2) : 
ACP(Z) and QCP(Z), 


with the following exhaustiveness (x) and coherence (c) hypotheses for A and Q: 


x. Z= |Ja= |v, 


acA wen 


cl. [a,,a2 € A, a, Fag] >a1Naz=0, 


c2. lwi, w2 E Q, wy Aug] > w1 Nw. =. 
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An abstract event (ie. a,b,c, d,e) is a — connected component 


An actual event (ie. 1,2,3,4,5,6) is a = connected component 
Figure 8.2: Event mapping: general case 


The exhaustiveness and coherence hypotheses are questionable; it will be seen that these hypotheses 


induce contradictions when fusing informations. 


Of course, the abstract universe Q is a coherent interpretation of the observations, when any actual 
event a € A is a subevent of an abstract event w € Q. But since the interpretation of A is necessarily 
partial and subjective, this property does not hold in general. The figure[8.2]gives an example of erroneous 
interpretation of the observations: the actual event 5 intersects both the abstract event d and the abstract 
event c. More precisely, if an actual event a € A happens, there is a perfect event z € a which has 
happened. Since Z contains (ie. maps to) Q, there is an unique abstract event, w € Q, which checks z, 
ie. z E€ w. As a conclusion, when a given actual event a happens, any abstract event w € Q such that 
wa # Q is likely to happen. Practically, such situation is easy to decide, since it just happens when a 
doubt appears in a measure classification. The table [81] refering to the example of figure [8.2] gives the 


possible abstract events related to each tossed observation. 


Finally, it does not seem possible to define a density of probability for unique abstract events from 
partially decidable observations. But it is possible to define a density function for multiple events. 


Again, a construction of such function, still denoted p, is possible in a frequentist manner: 
1. Set p(¢) = 0 forall dc Q, 
2. Make N tossing of an actual event. For each tossed event, a, do: 
(a) Define the set (a) = {wEQ/wnaFO}, 
(b) Set p(ó(a)) = pló(a)) +1, 


3. Set p(¢d) = +p(¢) for al p CM. 
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Tossed observation Possible abstract events 
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1 a > o c e 
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Oo 
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3 Oo Oo 
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Oo 
Oo 
b 
Oo Oo 
e! 
Oo 
Oo 
Oo 
Oo 
5 e ss e , of 
Oo 
Oo 
Oo 
b Oo 
Oo Oo Oo Oo Oo 
6 > Oo £ Oo > e 
Oo Oo Oo Oo Oo 








Table 8.1: Event multi-mapping for figure[8.2] 


In particular, p(0) = 0. 


In the particular case of table[8.1], this density is related to the probability of observation by: 


pía, c} = p(1) > pta; = p(2) > p{b} = p(3) + p(4) > pie, d} = p(5) > p{b, C, e} = p(6) è 


8.2. BELIEF THEORY MODELS 161 


The previous discussion has shown that the definition of a density of probability for the abstract events 
does not make sense, when the interpretations of the observations are approximative. However, it is 
possible to construct a density for multiple abstract events. Such a density looks quite similarly to the 


Basic Belief Assignment of DST, defined in the next section. 


8.2.2 Dempster Shafer Theory 
8.2.2.1 Definition 


A Dempster Shafer model [1] [2] E] is characterized by a pair (Q, m), where Q is a set of abstract events 
and the basic belief assignment (bba) m is a non negatively valued function defined over P(Q), the set 


of subsets of (2, such that: 
m(0) =0 and 5 m(¢) =1. 


peca 


A DSm (Q, m) could be seen as a non deterministic interpretation of the actuality. Typically, it is a tool 


providing informations from a sensor. 


8.2.2.2 Belief of a proposition 


Let $ C Q be a proposition. Assume a basic belief assignment m. The degree of belief of @, Bel(¢) , and 
the plausibility of p, Pl(¢), are defined by: 


Bel(¢) = J) m() and Pl6$)= Y m(p). 


vce WFO 
Bel and Pl do not satisfy the additivity property of probability. Bel(¢) and Pl(¢) are the lower and upper 
measures of the “credibility” of the proposition $. These measures are sometimes considered as the lower 


bound and the upper bound of the probability of ¢: 


Bel(¢) < P(¢) < PLUG) . 
This interpretation is dangerous, since it is generally admitted that probability and DST are quite different 
theories. 
8.2.2.3 Fusion rule 


Assume two bba mı and ma, defined on the same universe Q, obtained from two different sources. It is 


generally admitted that the sources are independent. Then, the bba mı © ma is defined by: 


mi © mal) =0 > 





mı $ m2(¢) = > Y matu1)malo), whereZ=1- Y mi(p1)ma(y»). 
pıNYp2=¢ pinye=0 


The operator O describes the (conjunctive) information fusion between two bba. 
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The normalizer Z is needed since the bba is zeroed for the empty set Ø. Except some specific cases, it is 


indeed possible that: 


ma (1 )ma(p2) > 0, (8.1) 


Wy. =O. (8.2) 


In particular, the property (8.2) is related to an implied coherence hypothesis; more precisely, since the 


universe (2 is defined as a set of events, the intersection of distinct singletons is empty: 


Vw), {wo} CO, {ar} # {wo} > {wi} N {or} =0. 


Notice that this hypothesis is quite similar to the hypothesis c2. of section[8.2.1]. The coherence hypothesis 
seems to be the source of the contradictions in the abstract model, when fusing informations. Finally, 
Z < 1 means that our abstract universe Q has been incorrectly defined and is thus unable to fit the both 
sensors. Z measures the error in our model of interpretation. This ability of the rule € is really new in 


comparison with probabilistic rules. 


8.2.3 Transferable Belief Model 


Smets has made an extensive explanation of TBM [8]. This section focuses on a minimal and somewhat 


simplified description of the model. 


8.2.3.1 Definition 


A Transferable Belief Model is characterized by a pair (Q, m), where Q is a set of abstract events and the 
basic belief assignment m is a non negatively valued function defined over P(Q) such that: 

Y) mí) = 1. 

¿can 


In this definition, the hypothesis m(0) = 0 does not hold anymore. 


8.2.3.2 Fusion rule 


Smets’ rule looks like a refinement of Dempster and Shafer’s rule: 


mi ®ma($)= Y, ma(1)ma(d2) - 
WiNpe=o 


Notice that the normalizer does not exist anymore. The measure of contradiction has been moved into 


m(@). This theory has been justified from an axiomatization of the fusion rule. 
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8.2.3.3 TBM generalizes DST 


First, notice that any bba for DST is a valid bba for TBM, but the converse is false because of Ø. Now, 
for any bba mr of TBM such that mr(@) < 1, construct the bba A(mr) of DST defined by: 


mr() 


A(mr)(0) =0 and ¥¢ CQ: ¢40, A(mr)(¢d) = es 


A is an onto mapping. Any bba mp of DST is a bba of TBM, and A(mp) = mp. 


A is a morphism for 9. JE. A(mri1 9 Mp2) = A(mr 1) 9 A(mpa2). 
Proof. By definition, it is clear that: 
A(mr1) 9 A(mr2)(0) = 0 = A(mr1 9 mr,2)(0). 


Now, for any ¢ C Q, such that 640: 


Y Almr1) 1) A(mr2)(¥2) 


Ma E =. 
: ; Y) Y, Alrr 1) 1) A(mr2) (2) 


PAN 1iNda=4 


mr (yı) se mr (Y2) 


T—mri@®) © T= mr Y mralh)mr (12) 


_  YıNY2=¢ _  PıNY2=¢ 
g mra) mrap) 5 5 mri (~1)mr,2(~2) 
2 MAR Imri) * 1=mp2(0) $40 WiMa=o 
mr, © m7,2(¢) mr1 © mra2(0) 
NE =H A p . 
y mri 0 mpaló) 1— mr 0 mpa(0) (mr ® mr 2d) 
¿A 


























Since A is an onto morphism, TBM is a generalization of DST. More precisely, a bba of TBM contains 
more information than a bba of DST, ie. the measure of contradiction m(@), but this complementary 
information remains compatible with the fusion rule of DST. 

The Dezert Smarandache Theory is introduced in the next section. This theory shares many common 
points with TBM. But there is a main and fundamental contribution of this theory. It does not make 
the coherence hypothesis anymore and the contradictions are managed differently: the abstract model 
is more flexible to the interpretation and it is not needed to rebuild the model in case of contradicting 


sensors. 


8.3 Dezert Smarandache Theory (DSmT) 


Both in DST and in TBM, the difficulty in the model definition appears when dealing with the con- 
tradictions between the informations. But contradictions are unavoidable, when dealing with imprecise 


informations. This assertion is illustrated by the following example. 
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B&W example. Assume a sensor sı which tells us if an object is white (W) or not (NW), and gives 
no answer (NAj) in litigious cases. The actual universe for this sensor is A; = {W, NW, NA; }. Assume 
a sensor s2 which tells us if an object is black (B) or not (NB), and gives no answer (NAg2) in litigious 
cases. The actual universe for this sensor is 42 = {B, NB, NA2}. These characteristics are not known, 
but the sensors have been tested with black or white objects. For this reason, it is natural to model 
our world by 2 = {black, white}. When a litigious case happens, its interpretation will just be the pair 


{black, white}. Otherwise the good answer is expected. The following properties are then verified: 
B,NW C black and W,NB C white. 


The coherence hypothesis is assumed, that is black N white = Ø. The event black N white is impossible. 
This model works well, as long as the sensors work separately or the objects are still black or white. Now, 
in a true universe there are many objects which are neither white and neither black, and this without 
any litigation. For example: gray objects. Assume that the two sensors are activated. Then, the fused 
sensors will answer NW Q NB , which will be interpreted by blackN white. This contradicts the coherence 


hypothesis. 


Conclusion. This example is a sketch of what generally happens, when constructing a system of de- 
cision. Several sources of information are available (two sensors here). These sources have different 
discrimination abilities. In fact, these discrimination abilities are not really known, but by running these 
sources on several test samples (black and white objects here), a model of theses abilities is obtained 
(here it is learned within Q that our sensors distinguish between black and white objects). Of course, it 
is never sure that this model is complete. It is still possible actually that some new unknown cases could 
be discriminated by the information sources. In the example, the combination of two sensors made it 
possible to discriminate a new class of objects: the neither black, neither white objects. But when fusing 
these sensors, the new cases will become contradictions regarding the coherence hypothesis. Not only the 
coherence hypothesis makes our model contradictory, but it also prevents us from discovering new cases. 
The coherence hypothesis should be removed! Dezert and Smarandache proposed a model without the 


coherence hypothesis. 


8.3.1 Dezert Smarandache model 


In DST and TBM, the coherence hypothesis was implied by the use of a set, Q, to represent the ab- 
stract universe. Moreover, the set operators N, U and c (ie. set complement) were used to explain the 
interactions between the propositions fp C 2. In fact, the notion of propositions is related to the notion 
of Boolean Algebra. Sets together with set operators are particular models of Boolean Algebra. Since 
DSmT does not make the coherence hypothesis, DSmT cannot rely on the set formalism. However, some 


boolean relations are needed to explain the relations between propositions. Another fundamental Boolean 
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Algebra is the propositional logic. This model should be used for the representation of the propositions 
of DSmT. Nevertheless, the negation operator will be removed from our logic, since it implies itself some 
coherence hypotheses, eg. ¢ \ ad = L! By identifying the equivalent propositions of the resulting logic, 
an hyper-power set of propositions is obtained. Hyper-power sets are used as models of universe for the 
DSmT. 


8.3.1.1 Hyper-power set 


Let © = {¢;/i € I} be a set of propositions. The hyper-power set < © > is the free boolean pre-algebra 


generated by ® and the boolean operators A and V: 
®,<@>A<@>,<O>V<O>cC< 65> 

and A, V verify the properties: 

Commutative. HJAV=YvVAH¿and HO VV=YVO, 

Associative. pA (WAn) = (dA) Anand dV (WV n) =(éVY)V4, 

Distributive. 6A (WV n) = (@A¥Y)V (PAN) and eV (VAN) = (VY) AVN), 

Idempotent. 6\gd=¢and dV o=4¢, 

Neutral sup/sub-elements. 6\(éVv)=d¢dand dV (dAv)=¢, 
for any ¢,v¥,nE< È >. 
Unless more specifications about the free pre-algebra are made, this definition forbids the propositions to 
be exclusive (no coherence assumption) or to be exhaustive. In particular, the negation operator, =, and 


the never happen / always happen, L/T, are excluded from the formalism. Indeed, the negation is related 


to the coherence hypothesis, since T is related to the exhaustiveness hypothesis. 


Property. It is easily proved from the definition that: 


VO, WE< >, dAV=zG0 E OVYVY=Y. 
The order < is a meta-operator defined over < ® > by: 
PSP = PAV=Z0 => OVY=Y. 
The order < is a meta-operator defined over < ® > by: 


<y = [¢<vanddFy]. 


The hyper-power set order < is the analogue of the set order C . 
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8.3.1.2 Dezert Smarandache Model 


A Dezert Smarandache model (DSmm) is a pair (B,m), where the (abstract) universe ® is a set of 
propositions and the basic belief assignment m is a non negatively valued function defined over < ® > 


such that: 


8.3.1.3 Belief Function 


The belief function Bel is defined by: 


VóE<DB>,Bel(p)= $O  m(p). (8.3) 
WE<B>:~<d 


Since propositions are never exclusive within < ® >, the (classical) plausibility function is just equal to 


1. The equation (8.3) is invertible: 


Vb E< © >, m(¢) =Bel($)- Y, m). 
pE<O>:v<¢ 


8.3.2 Fusion rule 


For a given universe ® , and two basic belief assignments mi and ma, associated to different sensors, the 


fused basic belief assignment is mi 6 ma , defined by: 


m0ma(9) = J, mi(p1)ma(d»). (8.4) 
1v1AY2=09 


8.3.2.1 Dezert & Smarandache’s example 


Assume a thief (45 years old) witnessed by a granddad and a grandson. The witnesses answer the 
question: is the thief young or old? The universe is then ® = {young, old}. The granddad answers that 


the thief is rather young. Its testimony is described by the bba: 
m,(young) = 0.9 and my(young V old) = 0.1 (slight unknown) . 
Of course, the grandson thinks he is rather old: 
malold) =0.9 and ma(young V old) = 0.1 (slight unknown) . 


How to interpret the testimonies? The fusion rule says: 


mı 9 ma(young A old) = 0.9801 (highly contradicts — third case) 


mı 9 ma(young) = mı O ma(old) = 0.0099 





mı ® ma(young V old) = 0.0001 


Our hypotheses contradict. There were a third case: the thief is middle aged. 
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8.3.2.2 Comments 


In DSmT, there is not a clear distinction between the notion of conjunction, A, the notion of third case 
and the notion of contradiction. The model does not decide for that and leaves this distinction to our 
last interpretation. It is our interpretation of the model which will make the distinction. Thus, the 
DSm model avoids any over-abstraction of the actual universe. Consequently, it never fails although 
we could fail in the last instance by interpreting it. Another good consequence is that DSmT specifies 
any contradiction/third case: the contradiction ¢ A Y is not just a contradiction, it is the contradiction 


between ¢ and y. 


8.4 Probability over logical propositions 


Probabilities are classically defined over measurable sets. However, this is only a manner to modelize the 
notion of probability, which is essentially a measure of the belief of logical propositions. Probability could 
be defined without reference to the measure theory, at least when the number of propositions is finite. 
In this section, the notion of probability is explained within a strict logical formalism. This formalism is 


of constant use in the sequel. 


Intuitively, a probability over a set of logical propositions is a measure of belief which is additive (disjoint 
propositions are adding their chances) and increasing with the proposition (weak propositions are more 
probable). This measure should be zeroed for the impossible propositions and full for the ever-true 
propositions. Moreover, a probability is a multiplicative measure for independent propositions. The 
independence of propositions is a meta-relation between propositions, which generally depends on the 


problem setting. 


These intuitions are formalized now. It is assumed that the reader is used with some logical notions. 


8.4.1 Definition 


Let L be at least an extension of the classical logic of propositions, that is L contains the operators A, V, 
— (and, or, negation) and the propositions L, T (always false, always true). Assume moreover that some 
propositions pairs of L are recognized as independent propositions (this is a meta-relation not necessarily 
related to the logic itself). A probability p over L is a IR* valued function such that for any proposition 
ġ and y of L: 


Additivity. plo ^Y) + pl V Y) =pl) + v(v), 


Coherence. p(L)=0, 





Finiteness. p(T) =1, 
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Multiplicativity. When ¢ and w are independent propositions, then p(o A Y) = p(¢)p(w). 


8.4.2 Property 
The coherence and additivity implies the increaseness of p: 
Increaseness. plo Aw) < p(d). 
Proof. Since ¢ = ($ A Y) V ($ Ap) and ($ AY) A ($ A =p) = L, it follows from the additivity: 
pC) + p(L) = p(o AY) +p ^=). 


From the coherence p(L) = 0, it is deduced p(¢) = p(@AW) + p(@A 4) . Since p is non negatively 
valued, p(¢) > p(ġ AY). 


























8.4.3 Partially defined probability 


In the sequel, knowledges are alternately described by partially known probabilities over a logical system. 


Typically, the probability p will be known only for a subset of propositions £C L. 


Partial probabilities have been investigated by other works [9], for the representation of partial knowl- 
edge. In these works, the probabilities are characterized by constraints. It is believed that this area 
has been insufficiently investigated. And although our presentation is essentially focused on the logical 
aspect of the knowledge representation, it should be noticed that it is quite related to this notion of 
partial probability. In particular, the knowledge of the probability for a subset of propositions implies 
the definition of constraints for the probability over the whole logical system. For example, the knowledge 


of m = p(@ Aw) implies a lower bound for p(¢) and p(w): p(d) > a and p(w) > 7. 


The next section introduces, on a small example, a new interpretation of DSmT by means of proba- 
bilized logic. 
8.5 Logical interpretation of DSmT: an example 


A bipropositional DSm model A = ({¢1, ¢2},m) is considered. This section proposes an interpretation 


of this DSm model by means of probabilized modal propositions. 


8.5.1 A possible modal interpretation 


Consider the following modal propositions: 
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U. Unable to decide between the ¢,;’s, 
Qi. Proposition ¢; is sure ; No Other Information (NOT), 
I. Contradiction between the ¢,’s. 
It is noticeable that these propositions are exclusive: 
Va,b € {U,a1,02,I}, a#bS>aANb=L. (8.5) 


These propositions are clearly related to the propositions ¢; : 


IT<di Ado, 01, d2, 01 V Oz [the contradiction I implies everything] 
ai < bi, 01 Vo, fori=1,2 [a; implies ¢; and ¢; V d2] (8.6) 
U < 601 V d2 [U only implies ¢ı V ¢2] 


These propositions are also exhaustive; ie. in the universe Y, either one of the propositions I, a, 02, U 
should be verified: 
IV ai VazVU = ¢ģġı V Q2. (8.7) 


Since the propositions a;,U,I are characterizing the knowledge about ¢; (with NOI), the doubt or the 
contradiction, it seems natural to associate to these propositions a belief equivalent to m(¢;) , m(¢1 V ¢2) 


and m(¢, A ¢2). These beliefs will be interpreted as probabilities over J, U and a; : 


pI) = m(¢1 Ada), p(U) = m(¢1 V ¢2) - plai) = m(¢;), fori =1,2. (8.8) 


Such probabilistic interpretation is natural but questionable: it mixes probabilities together with bba. 
Since the propositions ¢; are not directly manipulated, this interpretation is not forbidden however. In 
fact, it will be shown next that this interpretation implies the fusion rule $ and this will be a posterior 


justification of such hypothesis . 


8.5.2 Deriving a fusion rule 


In this section, a fusion rule is deduced from the previous probabilized modal interpretation. This rule 
happens to be the (conjunctive) fusion rule of DSmT. 
Let A; = ({¢1, 2}, mj) be the DSm models associated to sensors j = 1,2 working beside the same 


abstract universe {¢1, 2}. Define the set of modal propositions Sj = {J;,aj1,aj2,U;}: 
Uj. Unable to decide between the ¢,’s, according to sensor j , 
aj; - Proposition @; is sure and NOI, according to sensor 7 , 


I; . Contradiction between the ¢;’s, according to sensor j . 
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The propositions of S; verify of course the properties (85), (8.6) , (87) and (8.8), the subscript ; being 
added when needed. Define: 


S = S1 A S2 = {a1 A ae/ay € Sy and ag € So}. 


Consider a = a; ^ a2 and b = bı A ba, two distinct elements of S. Then, either a, Æ bı or ag Æ b2. Since 


S; verifies (8.5) , it follows a; A by = L or az A b2 = L, thus yielding: 
(az Mas) A (by A b2) = (az A bi) A (az A ba) =l. 


S is made of exclusive elements. It is also known from (8/7) that $1 V ¢2 = Va es, as ; Sj is exhaustive. 


It follows: 
b1 V b2 = (Q1 V b2) A (G1 V $2) = A V a= Va. 


j=1aj,€5; acs 


S is exhaustive. In fact, S enumerates all the possible cases of observation. It is thus reasonable to think 
that the fused knowledge of these sensors could be constructed from S. The question then arising is: 
what is the signification of a proposition a; A a2 € S? It is remembered that a proposition of S} just tells 
what is known for sure according to sensor j. But the semantic for combining sure or unsure propositions 


is quite natural 


e unsure + unsure = unsure 





e unsure + sure = sure 
e sure + sure = sure OR contradiction 
e anything + contradiction = contradiction 


In particular contradiction arises, when two informations are sure and these informations are known 


contradictory. This conduces to a general interpretation of S: 


A Ip a21 a22 U2 


I, | Contradiction | Contradiction | Contradiction | Contradiction 
a1 | Contradiction Contradiction (1 is sure 
a 2 | Contradiction | Contradiction Pa is sure 


Ur | Contradiction (1 is sure (a is sure Unsure 






At last, any proposition of S is a sub-event of a proposition T, œ1,@2 or U, defined by: 
U . The sensors are unable to decide between the (,'s, 
Qai. The sensors are sure of the proposition ¢; , but do not know anything else, 


I. The sensors contradict . 


lIn fact, the independence of the sensors is implicitly hypothesized in such combining rule (refer to next section). 
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Since S is exhaustive, the propositions U, a;, I are entirely determined by S: 


e l= (L A Ip) V (hh A Q21) V (hh A Q22) V (hh A U2) V (011 A In)V 
(012 A Iz) V (Ui A h) V (012 A Q21) V (011 A Q22) , 


e Qi = (01 A Qi) V (U1 A cai) V (a1; A U2), 
e U = UL AU. 


The propositions IJ,a;,U are thus entirely specified and since S is made of exclusive elements, their 


probabilities are given by: 
e p(I)=p(h A 12) +p Ma21) + pi Mar) +p A U2) +: + plar Maz), 
o p(ai) = plari A aa) + p(U1 A azi) + plari A U2), 
e p(U) = p(U¡ A U32). 


At this point, the independence of the sensors is needed. The hypothesis implies p(aı ^ a2) = p(aı)p(a2). 


The constraints (8.8) for each sensor j then yield: 
e p(I) = mi (pı Ad2)maló1 A G2) + milór Ad2)maló1) + +++ + Mı ($1)Mm2(¢2) , 
© p(ai) =mi(0;)ma(9;) + Mı ($1 V b2)ma(Gi) + Mı (Gi)M2(b1 V $2), 
e p(U) = mild V d2)maló1 V $2). 
The definition of mı ® ma implies finally: 
pU) = mı © ma(ġ1 A p2), — plas) = mı mali), and p(U) = m1 P m2(¢1 V ¢2). 


Our interpretation of DSmT by means of probabilized modal propositions has implied the fusion rule @. 


This result is investigated rigorously and generally in the next section. 


8.6 Multi-modal logic and information fusion 


This section generalizes the results of the previous section. The presentation is more formalized. In 
particular, a multi-modal logic for the information fusion is constructed. This presentation is not fully 


detailed and it is assumed that the reader is acquainted with some logical notions. 


8.6.1 Modal logic 


In this introductory section, we are just interested in modal logic, and particularly in the T-system. There 


is no need to argue about a better system, since we are only interested in manipulating the modalities 
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0, o and -0. 





y) 


Being given ® a set of atomic propositions, the set of classical propositions, C(®) more simply denoted 


C, is defined by: 
e@ccC,leCandTec, 
elfo. pEeC, then “PEC, PAWVEC, dVHECandgdg-YEC. 
The set of modal propositions, M(®) also denoted M, is constructed as follows: 


e CCM, 





e If p€ M, then Dd € M andode M, 











e fp, y EM, then ~ne M, opA^ApEM,ġyvyEM  andọġ—>ypyEM. 











The proposition O¢ġ will mean that the proposition @ is true for sure. The proposition op will mean that 





the proposition @ is possibly true. 
In the sequel, the notation F ¢ means that ¢ is proved in T. A proposition ¢ such that | ¢ is also called 
an axiom. The notation ¢ = Y means both F 4 > Y and F Y > @. 


All axioms are defined recursively by assuming some deduction rules and initial axioms. 
Modus Ponens (MP). For any proposition ¢, 4 € M , such that + d and F ¢ > 4, it is deduced + y. 
Classical axioms. For any ¢,~,7 € M , it is assumed the axioms: 


LET, 








4. F (a6 > =H) > (50 > Y) > O), 


5. L=-T, 





6. p> P=-oVy, 
7. OAD == V Y). 


It is deduced from these axioms that: 


The relation F ¢ — w is a pre-order with a minimum L and a maximum T: L is the strongest 


proposition, T is the weakest proposition, 


e The relation = is an equivalence relation. 


Modal axioms and rule. Let /,4 EM. 




















i. From F @¢ is deduced | O¢; axioms are sure. This does not mean + ¢ > O@ which is false! 
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ii. F O(@ — 4) > (06 — Oy) ; when the inference is sure and the premise is sure, the conclusion 


is sure, 











iii. F Od — ¢; sure propositions are true, 














iv. o¢ = 500; is unsure what cannot be false for sure. 














It is deduced that the proposition Dd is stronger than ¢ which is stronger than o@. 





Notation. In the sequel, y < $ means F Y — ¢, and y < @ means both y < ¿and y LY. 


The logical operators are compatible with =. Denote ¢/= = [y € M/w = €), the class of 
equivalence of $. Let gp YEM, $ € f/= and v € w=. Then holds: 















































e do >be ($> 1), e a¢ € (>¢)/= . ABE (OAV) = 
© ¿VLVE(óV)/= e O¢ € (Od) /= e od€ (o¢)/= 
The logical operators over M are thus extended naturally to the classes of M by setting: 
© bet (>We babe + ba Ada ln) 
© b/2V b= = (6VY)/= e Od/22(Od)j2 © 06=2(06)= 


From now on, the class ¢/= is simply denoted ¢. 


Hyper-power set. Construct the subset of classical propositions F'(®) recursively by the properties 
® c F(®) and Yg, Y E€ F(®), [PAY E F(®) and Vy € F(D)]. The hyper-power of ® , denoted < © >, 


is the set of equivalence classes of F(Ẹ) according to the relation =: 


< © >= F(®)/= = {6/2 / $ € F(®)}. 





8.6.1.1 Useful theorems 


Let ¢,~ € M. 















































1. E (DH A Oh) > O(¢ Ad) and H O(6 AY) > (06 A DY) 








2. E (op V op) = Of V Y) and E o(@V Y) > (oO V ow) 















































3. + (04 V Oy) > O(¢ V Y) but Y O(¢ V Y) > (04 v OY) 














4. Felo Ayp) = (OA ow) but ¥ (op Ao) —> (GAY) 


Proof. Theorem [and theorem P]are dual and thus equivalent (rules Hand iv.). It is exactly the same 
thing for theorem B] and theorem [4] 
Proof of + (O¢ A Oy) = O( Ay). 


























Classical rules yield the axiom: 
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ES > (1 > ($ A) 

Rule i. implies then: 

E O(¢ > ( > (6A p))) 

Applying rule ii. twice, it is deduced: 

- Og > OW > ($ A 4) 

- O¢ > (Op > O(6 A p)) 

The proof is concluded by applying the classical rules. 
Proof of + O(@Aw) > (09 A Oy). 










































































Classical rules yield the axioms: 


F (PAY) > ¿and (PAY) > Y 


Rule i. implies then: 

F OG Ab) > p) and F OC AY) > 4) 
Applying rule ii., it is deduced: 

E O(6 Ap) > O¢ and F O(6 Ay) > OY 
The proof is concluded by applying the classical rules. 











































































































Proof of + (04 v Oy) => O(¢ V 4). 
Classical rules yield the axioms: 


Ed => (V Y) and F 4 => ($ V y) 
Rule i. implies then: 


E O(¢ > (6 v Y)) and E O > (6 V Y) 
Applying rule ii., it is deduced: 


- O¢ > O(6 V p) and E Oy > O(¢ V Y) 
The proof is concluded by applying the classical rules. 


Why ¥ O(¢ V 4) > (0¢ v Oy)? 


To answer this question precisely, the Kripke semantic should be introduced. Such discussion is 



















































































outside the scope of this paper. However, some practical considerations will clarify this assertion. 
When ġ Vy is sure, does that mean that ¢ is sure or y is sure? Not really since we know that ġ or 
w is true, but we do not know which one is true. Moreover, it may happen that ¢ is true sometimes, 


while ~ is true the other times. As a conclusion, we are not sure of ¢ and are not sure of 4. 


























This example is a counter-example of F O(¢ V 4) => (06 v Ow). 
































8.6. MULTI-MODAL LOGIC AND INFORMATION FUSION 175 


8.6.2 A multi-modal logic 


Assume that several informations are obtained from different sources. Typically, these informations 


are modalities such as “according to the source o , the proposition p is sure”. Such a modality could 




















be naturally denoted O0,¢ (a modality depending on g). A more readable notation [d|c] 2 00 is 








prefered. Take note that [¢d|o] is not related to the Bayes inference (|o)! Now, the question arising is 
how to combine these modalities? For example, is it possible to deduce something from [1/01] A [¢2|o2] ? 
Without any relations between heterogeneous modalities, it is not possible to answer this question. Such 
a relation, however, is foreseeable. Assume that the source r involves the source a, ie. T => 0. Now 
assume that the proposition ¢ should be known from the source a, ie. [plo]. Since 7 involves ø, it is 
natural to state that ¢ should be known from the source 7, ie. [¢|T]. This natural deduction could be 
formalized by the rule: 


ET>=o0 implies F [dla] > [dlr]. 
With this rule, it is now possible to define the logic. 


The set of multi-modal propositions, mM(®) also denoted mM, is defined recursively: 
eCcmM, 
e If ġ,o E MM , then [blo] EMM, 
elfóyvemM, then =-pEemM,ó6AVEMM,P¿VYVEMM andġ—>yeEmM. 
The multi-modal logic obeys to the following rules and axioms: 
Modus Ponens. 
Classical axioms. Axioms[]to[/], 
Modal axioms and rule. Let 0,7,0,4 EMM. 


m.i. From F ¢ is deduced F [d|o]: axioms are sure, according to any sources, 





mii. Elo > plo] > ([plo] — [wI\o]) . If a source of information asserts a proposition and recognizes 
a weaker proposition, then it asserts this weaker proposition , 

mii. E [plo] — 9. The sources of information always tell the truth. If a source asserts a proposition, 
this proposition is actually true, 

miv. +7 o0 implies + [¢|o] > [6|7]. Knowledge increases with stronger sources of informa- 


tion. 


The axiom m.iii. is questionable and may be changed. But the work presented in this paper is restricted 


to this axiom. 
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It is also possible to consider some exotic rules like ¢ = [¢|L], ie. a perfect source of information L 
yields a perfect knowledge of the propositions ¢. Similarly, the modality [¢|T] could be interpreted as 
the proposition “¢ is an absolute truth” or “p has a proof” : one does not need any source of information 


to assert an absolute truth... 


8.6.3 Some multi-modal theorems 
8.6.3.1 Modal theorems map into multi-modal logic 


Let y € M be a modal proposition. Let o € mM be a multi-modal proposition. Let ufo] € mM be the 











multi-modal proposition obtained by replacing O by [-|o] and o by [> - lo] in the proposition y. Then 





F u implies + lo]. 
8.6.3.2 Useful multi-modal theorems 
If the source o asserts $ and the source T asserts Y , then the fused sources assert 6A W : 


E ([ólo] A irl) > [6 A plo Az] 


Proof. From the axioms + (ø AT) —>0 and F (a AT) >7, it is deduced: 





E [plo] > [plo M7], 


and 


E [elt] > [plo Az]. 


From the useful theorems proved for modal logic, it is deduced: 
llo AT] A Wlo nr] = [6A lo Ar]. 


The proof is concluded by applying the classical rules. 


























If one of the sources o or T asserts ġ , then the fused sources assert q : 


E (lólo] v [$|7]) > [slo A7]. 


Proof. This results directly from + [flo] > [ólo A T] and E [olr] > [dla A7]. 


























The converse is not necessarily true: 


¥ [ólo AT] > (lólo] v [6/7]) - 
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In fact, when sensors are not independent and possibly interactive, it is possible that the fused sensor o NT 
works better than o and T separately! On the other hand, this converse property could be considered as 
a necessary condition for the sensor independence. This discussion leads to the introduction of a new 


axiom, the independence axiom m.indep. : 


m.indep. F [dla A7] > ([ólo] V [¢|z]) . 


8.6.4 Sensor fusion 
8.6.4.1 The context 


Two sensors, o and 7, are generating informations about a set of atomic propositions $. More precisely, 
the sensors will measure independently the probability for each proposition ¢ E< ® > to be sure. In this 


section, it is discussed about fusing these sources of information. 


This problem is clearly embedded in the multi-modal logic formalism. In particular, the modality [|ø] 
characterizes the knowledge of o about the universe < ® >. More precisely, the proposition [¢|c] explains 
if @ is sure according to ø or not. This knowledge is probabilistic: the working data are the probabilities 


p({dla]) and p([¢|r]) for p E< ® >. The problem setting is more formalized in the next section. 


Notation. From now on, the notation p[p|o] is used instead of p([d|o]). Beware that plojo] is not 


the conditional probability p(¢|c) ! 


8.6.4.2 Sensors model and problem setting 


The set of multi-modal propositions, mM (O), is constructed from the set O = 6U{o, r}. The propositions 
o and 7 are referring to the two independent sensors. The proposition o A 7 is referring to the fused 
sensor. It is assumed for sure that V ¿eg ¢ is true: 


NZ 


pE 








Consequently: 


T 


Vo de 


peo 





ear = [Yo 


pE? 





ly 


pE? 





biv 


pE 


ds 


The sensors o and 7 are giving probabilistic informations about the certainty of the other propositions. 








and: 


NE 


pE? 





za =| V $ 


pE? 





a| =| Vo 


pE 





More precisely, it is known: 
plólo] and plólr], for any ġe<®>. 


Since the propositions o and 7 are referring to independent sensors, it is assumed that: 
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e The axiom m.indep. holds for ø and 7, 


e For any $, Y E< ® >, p([dlo] A [Y|7]) = plélolply iz]. 


A pertinent fused information is expected: 
How to compute plojo AT] for any pE<D> ? 


8.6.4.3 Constructing the fused belief 


Defining tools. The useful propositions ¢( are defined for any ¢ €< ®>: 


[slo] A -( V we] 


WpE<O>:V~<o 


IIe 


po? 
The same propositions ¢7) are defined for 7: 


| V bin). 


YE<P>:Y<Ó 


I> 


po 


Properties. 


The propositions p(%) are exclusive: 
dO AYO =L, forany d#v. 


Proof. Since [¢|a] A [plo] = [$ A plo], it is deduced: 


PURPOSE lenta al V mo) nl V mo) . 


n:n<ód NNP 


It follows: 
pO AW = [PA plo] A ( A ~ilo) ^ ( A “We 
n:n<p NNKY 
Since PAY < dor HAY <y when ¢ Æ Y, the property is deduced. 


























Lemma: V [lo] < [do] . 
wp<¢d 


Proof. The property y < ¢ implies successively  w > 6, [yv > blo] and E [wo] — [plo]. The lemma 


is then deduced. 


























The propositions ¢ are exhaustive: 


V YO = [lólo], and in particular: V AZ 
p:e $E<D> 


8.6. MULTI-MODAL LOGIC AND INFORMATION FUSION 179 


Proof. The proof is recursive. First, the smallest element of < ® > is p= N ge<d> ġ and verifies: 


po = [ulo]. 
Secondly: 
V p= dv ( V wo) =p y V V i) = gv ( V we] ; 
wipso wip<o pb<d nny pp<g 


Since ¿(9 = [dla] A al Vai ollo) and Vy.ycglvle] < [dle] , it follows: 


V y) = [do]. 
pose 
The second part of the property results from: 


V = V v” and Vo 


pE<®> VW gge È vee 


ol =T. 
































The propositions $7) are also exclusive and exhaustive: 
GI AYO =1, for any dF yp, 


and: 


Y %0 sT. 


pE<Db> 


It is then deduced that the propositions 6 AY") are exclusive and da 
Vor, di, 62,12 E< P>, (61,01) F (62,92) = (A? AY) AGM Ap) =L, (8.9) 


and: 


(QP AY) =T. (8.10) 
PYEXD> 


From the properties (8.9) and (8.10) , it results that the set: 
B= {9 nw /dpe<a>} 


is a partition of T. This property is particularly interesting, since it makes possible the computation of 


the probability of any proposition factorized within >: 


vac E, of Vo) => pġ). (8.11) 


PEA PEA 


2The notation ($1, Y1) = ($2, 72) means ¢1 = ¢2 and 41 = Ya. 
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Factorizing $77). It has been shown that E ([@lo] A [w|7]) > [9 Alo A 7]. It follows: 


E V (lólo] A flr) > [nilo Az], (8.12) 


PAYSN 


The axiom m.indep. says + [nla ArT] > ([nlo] V [n|T]) . Since [Veco 9/0] = [Veco 4|7] =T, it is 


deduced: 
H Ilo Ar] => (tora [Vacsd=)) v ([Viero)o] 4 in) ) 
At last 
InloAtT]= V (lólo] flr). (8.13) 
PASH 


It is then deduced: 


gen stale nn ao Y plea) =( V (Inlo] A [¢|7] JE (y Vv ([nlo] A [¢|7] )). 


w<d RET b<d naay 
Now: 
Vorceo(lnle] n[c/r]) = Vaness (Vert) A (Vis 0) 
Vince Naa EO DIED NON 
At last: 


gan = (Vance ^ ce) Ias (Voce Vancsu ln’? A e) 


~ (Vanco A e) i (Vast A c) l 


Since * is a partition, it is deduced the final results: 


pon) = V yo A ci) (8.14) 
n\G=o 
and: 
p(o) fs 5 p(n AC) ; (8.15) 
nAc=0 


This last result is sufficient to derive p[¢|o A 7], as soon as we are able to compute the probability over 


2. The next paragraph explains this computation. 
The probability over >. 


Computing p(9(”)) and p(¢) . These probabilities are computed recursively from p[d|o] and 


plọlr]. More precisely, it is derived from the definition ¢® = [flo] A AV z slvlel) and the property 


[glo] = Vue y) that: 


= [plo] A -( V Yo) . 


p<o 
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Since the propositions ¢() are exclusive and Vu ey WO < [elo], it follows: 


p(o) = plólo] - Y pO) . (8.16) 


p<o 


This equation, related to the Moebius transform, is sufficient for computing p(¢) recursively. 


Deriving p(o) . First, it is shown recursively that: 
PG AG) = (6) . nn 


Proof - step 1. For the smallest element u = Reese $, it happens that u(%) = [ulo] and y” = [ur]. 
Since [po] and [u|r] are independent propositions, it follows then p(y?) A p) = p(w) p(w) : 


Proof - step 2. Being given ¢,~ €< ® >, assume p(n? A a = = p(n? e ie for any 7,¢ E< ®> 
such that (n < ¢ and € < Y) or (n < $ and € < 4%). From [plo] = V,,< ¿1 and [vlr] = Vecy 6 
it is deduced: 


datan (Vi) a (Y e) = Y Y (ac). 


n<p <p n<p CY 
It follows: 
P([dlo] A WIt]) = Y Y p(n Ac 
n<p CY 

plola] = X` p(n”) 
nse 

plir = Y p) 
<y 


Now, [ġ|o] and [:)|r] are independent and p([ólo] A [¥|7]) = p[d/0]p[1)7]. Then: 


5 Xo p(n AcM) Y, Sl (n°) pm). 


n< C<p nso C<p 


From the recursion assumption, it is deduced p(¢ AY) = p(o™)p(y™) . 


























From the factorization , it is deduced: 


pg) = Y p(n) (6) (8.18) 


nAc=0 


This result relies strongly on the independence hypothesis about the sensors. 


Back to [ólo A 7]. Reminding that [do AT] =V ye, 7), the fused probability p[d|o A 7] is 


deduced from p(y e^) by means of the relation: 


plólo At] = Y pe~). (8.19) 
vee 
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Conclusion. It is possible to derive an exact fused information plólo AT] , p E< ® > from the 
informations p[¢|o] , $ E< ® > and pi¢|T] , $ E< ® > obtained from two independent sensors o and 7. 


This derivation is done in 3 steps: 
e Compute p(o) and pl) by means of : 
e Compute p(4(?/7)) by means of (18), 


e Compute p[¢|o A 7] by means of (8.19) . 


8.6.4.4 Link with the DSmT 


It is noteworthy that the relation (8.18) looks strangely like the DSmT fusion rule (8,4), although these 
two results have been obtained from quite different viewpoints. In fact the similarity is not just related 
to the fusion rule and the whole construction is identical. More precisely, let us now consider the problem 


from the DSmT viewpoint. 


Let be defined for two sensors ø and 7 the respective bba mo and m, over < ® >. The belief function 


associated to these two bba, denoted respectively Bel, and Bel, , are just verifying: 


Bel, (6) = Y mo(w) and Bel,(6) = Y m-(#). 
Ypo po 


Conversely, the bba mo is recovered by means of the recursion: 


Vo E< ® >, mo($) = Bel, (¢) — X mo (W) - 
p< 


The fused bba mo ® m, is defined by: 


Mo D m,() = 5 Mo (y)m, (7) # 
vAn=0 


Now make the hypothesis that the probabilities p[p|0] and p[p|7] are initialized for any ¢ E< ® > by: 


plólo] =Bel,($) and plólr] = Bel,(¢) . 
Then, the following results are obviously obtained: 
e p(9) = mo(¢) and p(4) =m, (6), 
e p(97) = mo &m,-(¢), 
e plólo A 7] = Bel, O Bel, (6), where Bel, O Bel, is the belief function associated to my O Mm, . 


From this discussion, it seems natural to consider the probabilized multi-modal logic mM as a possible 


logical interpretation of DSmT. 
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Evaluate the consequence of the independence axiom. By using the axiom m.indep. , it is possible 
to prove (8.13). Otherwise, it is only possible to prove (8,12), which means that possibly more belief 
is put on the smallest propositions, in comparison with the independent sensors case. Such a property 
expresses a better and more precise knowledge about the world. Then it appears, accordingly to the 
mM interpretation of DSmT, that the fusion rule € is an optimal rule only for fusing independent and 


(strictly) reliable sensors. 


8.7 Logical interpretation of the Bayes inference 


Notation. In the sequel, d > y is just an equivalent notation for (Y —> ¢) A(¢@— Y). 


General discussion. The Bayes inference explains the probability of a proposition y , while is known 


a proposition ¢. This probability is expressed as follows by the quantity p(w|@) : 


p(o Av) = p(o)P(H 14) - 


From this implicit and probabilistic definition, (Y|ġ) appears more like a mathematical artifice than an 
actual “logical” operator. However, (y|) has clearly a meta-logical meaning although it is intuitive and 
just implied: it characterizes the knowledge about 4, when a prior information ¢ is known. In this 
section, we are trying to interpret the Bayes operator ( | ) as a logical operator. The author admits 
that this viewpoint seems extremely suspicious: the Bayes inference implies a change of the probabilistic 
universe, and then a change of the truth values! It makes no sense to put at the same level a conditional 
probability with an unconditional probability! But in fact, there are logics which handle multiple truths: 
the modal logics, and more precisely, the multi-modal logics. However, the model we are defining here is 
quite different from the usual modal models. 

From now on, we are assuming a same logic involving the whole operators, ie. A, =, V, — and (| ), and 


a same probability function p defined over the resulting propositions. 


When defining a logic, a first step is perhaps to enumerate the intuitive properties the new logic should 
have, and then derive new language and rules. Since a probability is based on a Boolean algebra, this 
logic will include the classical logic. A first question arises then: is the Bayes inference ( | ) the same 
inference than in classical logic? More precisely, do we have (w|¢) = ¢ — y? If our logical model is 


coherent with the probability, this should imply: 


Pld) = plo > Y) = pe V y) . 


Applying the Bayes rule, it is deduced: 





p(o AY) = plep V Y) = (ph Ab) +p A =)) 01 — plo Ap) - 


184 CHAPTER 8. PROBABILIZED LOGICS RELATED TO DSMT AND BAYES INFERENCE 


This is clearly false: eg. taking p(¢ A 4) = + and p(9 Ay) = > results in 3 = 5! The Bayes inference 
(wId) is not a classical inference. Since it is a new kind of inference, we have to explain the meaning of 


this inference. 


The Bayes inference seems to rely on the following principles: 


e Any proposition ¢ induces a sub-universe, entirely characterized by the Bayes operator (-|¢). For 
this reason, (-|¢) could be seen as a conditional modality. But this modality possesses a strange 
quality: the implied sub-universe is essentially classical. From now on, (-|¢) refers both to the 


modality and its induced sub-universe, 


e The sub-universe (-|T) is just the whole universe. The empty universe (-|1) is a singularity which 


cannot be manipulated, 


e The sub-universe (-|ġ) is a projection of the sup-universe (which could be another sub-universe) 
into ¢. In particular, the axioms of (-|¢) result from the propositions which are axioms within the 


range @ in the sup-universe. Moreover, the modus ponens should work in the sub-universes, 


e Any sub-proposition (w|¢) implies the infered proposition ¢ — 4 in the sup-universe. This last 
point in not exactly the converse of the previous point. The previous point concerns axioms, while 
any possible propositions are considered here. This (modal-like) difference is necessary and makes 


the distinction between (| ) and >, 
e Since sub-universes are classical, the negation has a classical behavior: the double negation vanishes, 


e The sub-universe of a sub-universe is the intersected sub-universe. For example, “considering blue 


animals within a universe of birds” means “considering blue birds”. 


In association with the Bayes inference is the notion of independence between propositions, described by 
the meta-operator x , which is not an operator of the logic. More precisely, ~ is independent to @, ie. 
wx @, when it is equivalent to consider w within the sub-universe ¢ or within the sup-universe. Deciding 
whether this meta-operator is symmetric or not is probably another philosophical issue. In the sequel, 
this hypothesis is made possible in the axiomatization but is not required. Moreover, it seems reasonable 
that complementary propositions like $ and —@ cannot be independent unless ¢ = T. In the following 


discussion, such a rule is proposed but not required. 


8.7.1 Definitions 
8.7.1.1 Bayesian modal language 
The set of the Bayesian propositions bM is constructed recursively: 


e CCbM, 
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e If 6,4 € bM, then (Yle) € OM, 


e If 6, EdM, then 4 EbM,¢ AY EbM, VY EbM andd > YE dM. 


8.7.1.2 Bayesian logical rules 

The logic over bM obeys the following rules and axioms: 

e Classical axioms and modus ponens, 

b.i. (¢|T) = 0; the sub-universe of T is of course the whole universe, 


b.ii. It is assumed 4 4. Then, F ¢ > Y implies + (y|); axioms within the range d are axioms of the 


sub-universe (-|d) , 


b.iii. It is assumed X 4. Then, + (4 = nio) > ((v19) — (mó) ; when both an inference and a 
premise are recognized true in a sub-universe, the conclusion also holds true in this sub-universe. 


This property allows the modus ponens within sub-universes, 


b.iv. It is assumed ¥ 49. Then, + (yl) — (6 — Y); the modality (-|p) implies the truth within the 


range Q, 


b.v. It is assumed 4 ~o. Then, (19) = (Wd); there is no doubt within the modality (-|¢). Sub- 
universes have a classical negation operator. However, truth may change depending on the propo- 


sition of reference Y, 


b.vi. It is assumed ¥ 7=(¢ A) ËI Then, (|v) |) = (niy Ad); the sub-universe (-|W) of a sub-universe 


(-|¢) is the intersected sub-universe (|p Av), 


b.vii. y x ọ means | (y|) > Y; a proposition y is independent to a proposition p when it makes no 


difference to observe it in the sub-universe (-|¢) or not, 
b.viii. (optional) 4 x ¢ implies ¢ x 4; the independence relation is symmetric, 


b.ix. (optional) Assuming ¢ x Y and F ¢ V Y implies F ¢ or F 4; this uncommon logical rule explains 
that complementary and non trivial propositions cannot be independent. EG. to an extreme degree, 


$ and 0 are strictly complementary and at the same time are not independent unless 6 = T or 


p=_. 


These axioms leave the modality (-| 1) undefined, by requiring the condition ¥ =p for any deduction on 
the sub-universe (-|ġ). In fact, the modality (-|1) is a singularity which cannot be defined according to 
the common axioms and rules. Otherwise, it would be deduced from + L > ¢ that F (|1); this last 


deduction working for any ¢ would contradict the negation rule =(=¢|L) = (@|L). Nevertheless, the 





axioms b.vii. and b.viii. induces a definition of x for any pair of propositions, except (L, L). 


3It will be proved that the hypothesis ¥ =(¢ A 4) implies the hypotheses ¥ —¢ and Y (014) . 
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8.7.2 Properties 
8.7.2.1 Probability over bM 


A probability p over bM is defined according to the definition of section [8.4]. In particular, since the 


meta-operator x characterizes an independence between propositions, it is naturally hypothesized that: 


exw implies p(9A Y) = p(d)p(y) . 


8.7.2.2 Useful theorems 
Sub-universes are classical. It is assumed Y 24. Then: 
e Vlo) = WI), 
© (Y Anlo) = le) A (ld), 
e (Y V mle) = Ie) V le), 
e (Y = nd) = (1/16) > (ald). 





Proof. The first theorem is a consequence of axiom b.v. 


From axiom b.iii., it is deduced + (24 V nle) > (>(v|¢) V (n|¢)) - Applying the first theorem, it 
is deduced E (4 V nlé) > ((>¥|¢) V (mI ¢)) . At last: 


E (wv mld) > (Ie) V (ale) - (8.20) 
It is deduced F =((W|9) V (n|@)) — =(4 V nlo) and, by applying the first theorem, 
E (vlg) A nlo)) = (ed A mg) . 
At last: 
E (IE) A (nl) = (V Anlo) - 
Now, it is deduced from + ¢ > (( An) > Y) that: 
E (An) > ylo) - 
By applying the axiom b.iii.: 
E (y Anlo) > (plo) - 
It is similarly proved that F (Y A nlo) — (np) and finally: 
E (Y And) > (Id) A (ale) - 
The second theorem is then proved. 
Third theorem is a consequence of the first and second theorem. 


Last theorem is a consequence of the first and third theorem. 
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Inference property. It is assumed 4 =4. Then (vió) A ¢ = Aw. In particular, the hypothesis 
K ~(@A Y) implies the hypotheses ¥ 4 and YX (4/0). 


Proof. From b.iv. it comes! (Y|) > (6 > y). Then =(9 > Y) — (pp) and F (pA) > (7y]¢) . 
It follows E ($ A Y) > (|) and finally: 


E (PAY) > (YI) ^g). 
The converse is more simple. From F (Y|) — (¢ > w), it follows: 


E (IH Ag) — (GY) AQ). 


Since (6 > Y) A ġ = oA, the converse is proved. 


























Intra-independence. It is assumed 4 4. Then (|) x (y|) is equivalently defined by the property 
E (lb) > nó) - 


Proof. 
(ine) = nle) = (alv) 16) > (mio) = (ne ay) > (nlo) 


= (njo ^ (ble) > (nlo) = (o)l) > (nó) - 


























Independence invariant. W% x ¢ implies ~y x ¢. 
Proof. 
Cyle) > mb = (116) > 4 = Id) OY. 





























Inter-independence. It is assumed ¥ ~o. Then (vid) x ¢. 


Proof. From axiom b.vi. : 


(CIAA) = IO AH) = le). 
It is deduced (Y|p) x @. 


























Corollary: assuming the rules b.viii. and b.iz., the hypotheses Y =p and Y (>w|¢) imply the hypothesis 

KA(dA%). 

Proof. Assume F (9 A Y). Then F (6 A (116) and F ~o V 7(w|¢). Since (410) x ¢, it follows 
x (416) from rule b.viii. And then ~g x 7(~|¢) . Now, applying the rule b.ix. to F =6V=(4|0), 
it is deduced F ~o or F =(4/H). 
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A proposition is true in its proper sub-universe. It is assumed ¥ 4. Then F (¢|¢). 


Proof. Obvious from F $ > @¢. 


























Narcissist independence. It is assumed 4 =p. Then, ¢x ¢ implies + ¢ and conversely. In particular, 


ox ġ implies p = T. 


Proof. 
(016) p=T=gpE=o. 


























Non transitivity (modus barbara fails). It is assumed ¥ 4 and ¥ 4. Then 


Y le) > (in) — (mo) - 


Proof. The choice Y = T , 7 = 7¢ and y Æ T is a counter example: 





(Tl6) > (CT) = (-9|6)) =T= (9 > L) =e. 


























8.7.2.3 Axioms and rules extend to sub-universes 

Assume ¥ 4. The rules and axioms of bM extend on the sub-universe (-|¢) : 
e E 4 implies F (Yló), 
e It is assumed K (94 A Y). Then + ( = nlo) implies + ((nly)lġ) , 


e It is assumed Y =(64 A Y). Then E ((9 > Cl) 16) > (nly) > Cle), 





e It is assumed 4 (9 A 4). Then F (lle) > (Y > nlọ). 


Proof. + w implies + ¢ > Y and then F (4/6). First point is then proved. 


It is successively implied from + (Y > lo): 

E le) — (nle) , 
E ((nl¢)|(@I1¢)) , 
E (nlo A (blo) , 
E (nle ^y), 

F (lhe). 
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Second point is then proved. 
By applying axiom b.iii. and first point, it comes: 
E ((n Clb) > (v) = (ch) |) - 


It follows: 
E ((n > Clw)lo) > (lv) > (<lw)lo) . 
Third point is proved. 


By applying axiom b.iv. and first point, it comes: 


E (nly) — (y > mo) . 


It follows: 
E ((nlb)|6) > (b > nlo) . 


Fourth point is proved. 


























8.7.2.4 Bayes inference 


It is assumed ¥ 4. Define p(w|¢) as an abbreviation for p((w|9)) . Then: 


plvlo)p(ó) = P(PAY) . 


Proof. This result is implied by the theorems (yY|9) Ad = 9 AY and (Yl) x ¢. 


























8.7.2.5 Conclusion 


Finally, the Bayes inference has been recovered from our axiomatization of the operator (-|-). Although 
this result needs more investigation, in particular for the justification of the coherence of bM , it appears 
that the Bayesian inference could be interpreted logically as a manner to handle the knowledges. A 
similar result has been obtained for the fusion rule of DSmT. At last, it seems possible to conjecture that 


logics and probability could be mixed in order to derive many other belief rules or inferences. 


8.8 Conclusion 


In this contribution, it has been shown that DSmT was interpretable in the paradigm of probabilized 
multi-modal logic. This logical characterization has made apparent the true necessity of an independence 


hypothesis about the sensors, when applying the € fusion rule. Moreover, it is expected that our work 
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has given some clarifications about the semantic associated with the conjunctive rule of DSmT. 

A similar logical interpretation of the Bayes inference has been constructed, although this preliminary 
work should be improved. At last, it seems possible to handle probabilized logics as a relatively general 
framework for manipulating non deterministic informations. This is perhaps a generic method for con- 
structing new customized belief theories. The principle is first to construct a logic well adapted to the 
problem, second to probabilize this logic, and third to derive the implied new belief theory (and forget 


then the mother logic!) : 


Probabilized 
Classical Logic Peay Probability 
> o 
New Logic Nen New Belief Theory 
propositions 


It seems obviously that there could be many theories and rules for manipulating non deterministic infor- 
mations. This is not a new result and I feel necessary to refer to the works of Sombo, Lefèvre, De Brucq 


and al. [6A [7] , which have investigated such questions. 


At last, a common framework for both DSmT and Bayesian inference could be certainly derived by fusing 


the logics mM and bM . 
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Chapter 9 


On conjunctive and disjunctive 


combination rules of evidence 


Hongyan Sun and Mohamad Farooq 
Department of Electrical & Computer Engineering 
Royal Military College of Canada 
Kingston, ON, Canada, K7K 7B4 


Abstract: In this chapter, the Dempster-Shafer (DS) combination rule is examined 
based on the multi-valued mapping (MVM) and the product combination rule of mul- 
tiple independent sources of information. The shortcomings in DS rule are correctly 
interpreted via the product combination rule of MVM. Based on these results, a new 
justification of the disjunctive rule is proposed. This combination rule depends on 
the logical judgment of OR and overcomes the shortcomings of DS rule, especially, in 
the case of the counter-intuitive situation. The conjunctive, disjunctive and hybrid 
combination rules of evidence are studied and compared. The properties of each rule 
are also discussed in details. The role of evidence of each source of information, the 
comparison of the combination judgment belief and ignorance of each rule, the treat- 
ment of conflicting judgments given by sources, and the applications of combination 
rules are discussed. The new results yield valuable theoretical insight into the rules 
that can be applied to a given situation. Zadeh’s example is also included in this 
chapter for the evaluation of the performance and the efficiency of each combination 


rule of evidence in case of conflicting judgments. 
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9.1 Introduction 


ombination theory of multiple sources of information is always an important area of research in 
Ce processing of multiple sources. The initial important contribution in this area is due 
to Dempster in terms of Dempster’s rule [I]. Dempster derived the combination rule for multiple in- 
dependent sources of information based on the product space of multiple sources of information and 
multi-valued mappings. In the product space, combination-mapping of multiple multi-valued mappings 
is defined as the intersection of each multi-valued mapping, that is, an element can be judged by combi- 
nation sources of information if and only if it can be judged by each source of information simultaneously, 
irrespective of the magnitude of the basic judgment probability. Shafer extended Dempster’s theory to 
the space with all the subsets of a given set (i.e. the power set) and defined the frame of discernment, 
degree of belief, and, furthermore, proposed a new combination rule of the multiple independent sources 
of information in the form of Dempster-Shafer’s (DS) combination rule [2]. However, the interpretation, 
implementation, or computation of the technique are not described in a sufficient detail in [2]. Due to 
the lack of details in [2], the literature is full of techniques to arrive at DS combination rule. For exam- 
ple, compatibility relations [3] [4], random subsets [5] [6] [7], inner probability [8] [9], joint (conjunction) 
entropy etc. have been utilized to arrive at the results in BJ. In addition, the technique has been 
applied in various fields such as engineering, medicine, statistics, psychology, philosophy and account- 
ing [I], and multi-sensor information fusion [12] {13} [14] [15] [16] etc. DS combination rule is more efficient 
and effective than the Bayesian judgment rule because the former does not require a priori probability 
and can process ignorance. A number of researchers have documented the drawbacks of DS techniques, 
such as the counter-intuitive results for some pieces of evidence [17 [18] [19], computational expenses and 


independent sources of information 20] PI]. 


One of the problems in DS combination rule of evidence is that the measure of the basic probability 
assignment of combined empty set is not zero, i.e. m(@) Æ 0, however, it is supposed to be zero, i.e. 
m(@) = 0. In order to overcome this problem, the remaining measure of the basic probability assignment 
is reassigned via the orthogonal technique [2]. This has created a serious problem for the combination 
of the two sharp sources of information, especially, when two sharp sources of information have only one 
of the same focal elements (i.e. two sources of information are in conflict), thus resulting in a counter- 
intuitive situation as demonstrated by Zadeh. In addition, DS combination rule cannot be applied to 
two sharp sources of information that have none of the same focal elements. These problems are not 


essentially due to the orthogonal factor in DS combination rule (see references [22] [23]). 


In general, there are two main techniques to resolve the Shafer problem. One is to suppose m(@) 4 0 


or m(@) > 0 as it is in reality. The Smets transferable belief model (TBM), and Yager, Dubois & 
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Prade and Dezert-Smarandache (DSm) combination rules are the ones that utilize this fact in refer- 
ences [20] [24] [25] (26) (27) [28]. The other technique is that the empty set in the combined focal elements is 
not allowed and this idea is employed in the disjunctive combination rule [22] [23] [31]. Moreover, 
E. Lefèvre et al. propose a general combination formula of evidence in [32] and further conjunctive com- 


bination rules of evidence can been derived from it. 


In this chapter, we present some of work that we have done in the combination rules of evidence. 
Based on a multi-valued mapping from a probability space (X, Q, u) to space S, a probability measure 
over a class 2° of subsets of S is defined. Then, using the product combination rule of multiple informa- 
tion sources, Dempster-Shafer’s combination rule is derived. The investigation of the two rules indicates 
that Dempster’s rule and DS combination rule are for different spaces. Some problems of DS combina- 
tion rule are correctly interpreted via the product combination rule that is used for multiple independent 


information sources. An error in multi-valued mappings in [1] is pointed out and proven. 


Furthermore, a novel justification of the disjunctive combination rule for multiple independent sources 
of information based on the redefined combination-mapping rule of multiple multi-valued mappings in 
the product space of multiple independent sources of information is being proposed. The combination 
rule reveals a type of logical inference in the human judgment, that is, the OR rule. It overcomes the 
shortcoming of DS combination rule with the AND rule, especially, the one that is counter-intuitive, and 
provides a more plausible judgment than DS combination rule over different elements that are judged by 


different sources of information. 


Finally, the conjunctive and disjunctive combination rules of evidence, namely, DS combination rule, 
Yager’s combination rule, Dubois and Prade’s (DP) combination rule, DSm’s combination rule and the 
disjunctive combination rule, are studied for the two independent sources of information. The properties 
of each combination rule of evidence are discussed in detail, such as the role of evidence of each source 
of information in the combination judgment, the comparison of the combination judgment belief and 
ignorance of each combination rule, the treatment of conflict judgments given by the two sources of 
information, and the applications of combination rules. The new results yield valuable theoretical insight 
into the rules that can be applied to a given situation. Zadeh’s example is included in the chapter 
to evaluate the performance as well as efficiency of each combination rule of evidence for the conflict 


judgments given by the two sources of information. 
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9.2 Preliminary 


9.2.1 Source of information and multi-valued mappings 


Consider n sources of information and corresponding multi-valued mappings [I]. They are mathemat- 
ically defined by n basic probability spaces (X;,0;, yi) and multi-valued mappings T; which assigns a 
subset Djx; C S to every 2; € Xi, i = 1,2,...,n. The space S into which I; maps is the same for each 


i, namely: n different sources yield information about the same uncertain outcomes in S. 


Let n sources be independent. Then based on the definition of the statistical independence, the 


combined sources (X, Q, p) can be defined as 


X=X1xX2Xx...x Xp (9.1) 
Q=N01xN2x...x Oy, (9.2) 
U= ply XX... X Un (9.3) 
for all x € X,T is defined as 
Ta =Tye@nTern...nT nz (9.4) 


The definition of [ implies that x; € X; is consistent with a particular s € S if and only if s € Lizi, 
for i = 1,2,...,n, and consequently x = (x1, 22,...,%n) € X is consistent with s if and only if s € Tiz; 


for alli = 1,2,...,n [I]. 


For finite S = {s1, $2, . - - , Sn}, suppose S5,5,...5, denotes the subset of S which contains s; if 6; = 1 
and excludes s; if 6; = 0, for j =1,2,...,n. Then the 2” subsets of S so defined are possible for all T;z; 


(i =1,2,...,n), and partition X; into 


x= U Xabi (9.5) 
dilación 
where 
XP oad, = los € Xi Titi = Shró2...6.) (9.6) 
and define [I] 


Piha ón = XE hada) (9.7) 
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9.2.2 Dempster’s combination rule of independent information sources 


Based on - (0.7), the combination of probability judgments of multiple independent information 

sources is characterized by [I] ee 1=1,2,...,n. That is 

= a) (2) (n) 
Houle Pa 2 PEDD DPED D. aD gle) at (9.5) 
5; =66? P 66 

Equation indicates that the combination probability judgment of n independent information 
sources for any element S5,5,...5, Of S equals the sum of the product of simultaneously doing probability 
judgment of each independent information source for the element. It emphasizes the common role of each 


independent information source. That is characterized by the product combination rule. 


9.2.3 Degree of belief 


Definition 1: 


If O is a frame of discernment, then function m : 2° — [0,1] is calle] a basic belief assignment 


whenever 
m(0) =0 (9.9) 
and 
Y m(A) =1 (9.10) 
ACO 


The quantity m(A) is called the belief mass of A (or basic probability number in [2]). 


Definition 2: 
A function Bel : 2° — [0, 1] is called a belief function over O P] if it is given by 
Bel(A) = X` m(B) (9.11) 
BCA 


for some basic probability assignment m : 2° — [0, 1]. 


Definition 3: 
A subset A of a frame 6 is called a focal element of a belief function Bel over O [2] if m(A) > 0. The 


union of all the focal elements of a belief function is called its core. 


Theorem 1: 
If O is a frame of discernment, then a function Bel : 2° — [0,1] is a belief function if and only if it 
satisfies the three following conditions [B]: 


lalso called basic probability assignment in [2]. 
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1. 
Bel(0) = 0 (9.12) 
2. 
Bel(O) =1 (9.13) 
3. For every positive integer n and every collection A1,..., An of subsets of O, 
Bel(A;U...UAn)= XO (-1)"**Bel(Nier Ai) (9.14) 
Ic{l,...,n} 
IA 
Definition 4: 
The function Pl : 2° — [0, 1] defined by 
PI(4) = 1 — Bel(A) (9.15) 


is called the plausibility function for Bel. A denotes the complement of A in 2°. 
Definition 5: 


If O is a frame of discernment, then a function Bel : 22 — [0,1] is called Bayesian belief [2] if and 


only if 
1. Bel(0) = 0 (9.16) 
2. Bel(@)=1 (9.17) 
3. If A,B C © and ANB=0, then Bel(AU B) = Bel(A) + Bel(B) (9.18) 


Theorem 2: 
If Bel : 2° — [0, 1] is a belief function over O, Pl is a plausibility corresponding to it, then the following 


conclusions are equal [2] 
1. The belief is a Bayesian belief. 
2. Each focal element of Bel is a single element set. 


3. VA C O, Bel(A) + Bel(A) = 1. 


9.2.4 The DS combination rule 


Theorem 3: 

Suppose Bel; and Belz are belief functions over the same frame of discernment O = [0,,02,...,0,) 
with basic belief assignments mı and ma, and focal elements A1, A2,..., Ak and B1, Bo,..., Bı, respec- 
tively. Suppose 

XO mi(Ai)ma(Bj) <1 (9.19) 


1,3 
Aj NB; =0 
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Then the function m : 2° — [0,1] defined by m(0) = 0 and 


2, mı(Ai)m2(B;) 
AnB =A 
1— 5 mı(Ai)m2(B;) 


1,3 
AiNB;=0 


m(A) = (9.20) 


for all non-empty A C € is a basic belief assignment [2]. The core of the belief function given by m is 
equal to the intersection of the cores of Bel; and Belg. This defines Dempster-Shafer’s rule of combination 


(denoted as the DS combination rule in the sequel). 


9.3 The DS combination rule induced by multi-valued mapping 


9.3.1 Definition of probability measure over the mapping space 


Given a probability space (X, Q, 1) and a space S with a multi-valued mapping: 


r:xX 3S (9.21) 


Vee X,Tacs (9.22) 


The problem here is that if the uncertain outcome is known to correspond to an uncertain outcome 


s € Taz, then the probability judgement of the uncertain outcome s € [x needs to be determined. 


Assume S consists of n elements, i.e. S = [51,82,...,Sn). Let's denote S5,5,...5, the subsets of S, 


where 6; = 1 or 0, i = 1,2,...,n, and 


95150...5n = U Si (9.23) 
iAj,54=1,6;=0 


then from mapping (21)-(@.22) it is evident that S5,5,...5, is related to Tx. Therefore, the 2% subsets 
such as in equation (9.23) of S yield a partition of X 


X= |) Kies, (9.24) 
6162...0n 
where 
X $6501.85 = {x E X, Tr = S5,5...5n } (9.25) 


Define a probability measure over 2% = {55,5,...5, } as M : 29 = {55,5,...5, } — [0,1] with 


0, S5i52...8n =I 
M(S6159...5,) = (9.26) 


U(X51 83... 5m) 
er 89... bn FO 
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where M is the probability measure over a class 25 = [S5,5,...5, } of subsets of space S which T maps X 


into. 


9.3.2 Derivation of the DS combination rule 


Given two n = 2 independent information sources, then from equation (9.8), we have 


1 2 
U( X51 60...5n) = 5 Pale ee Ceo ee (9.27) 
PX5159...5n TOX Gy a MPO Xs ay. ay 


From equation (9.26), if S5,5,...6, #0, we have for i = 1,2 
Ls...) = MO (85, 59...5n)(1 — uË (Xoo...0)) (9.28) 


and 


WX Sy 60...6n) = M (S5162...8n)(1 — (Xoo...0)) (9.29) 


where equations (9.28) and (9.29) correspond to information source i, (i = 1,2) and their combined 
information sources, respectively. Substituting equations (9.28)-(@29) into equation (0,27), we have 


5 M® (S50 55.57 MUS 54 ...51) [1 — pC oll = pO al 


M =w A AA A A A A A 9.30 
ne 1- pi(Xoo..0) va) 
and 
1 2 
[1 - pO XD i - uO Xo) _ [t= uO (XE IE PA o) 
= 1 2 
1 = w(Xoo...0) > Ta aan Er TT 
PXG. a TXS sy. au AO 
1 
Se O (9.31) 
5 MS? (Ss; 5...5, M (Sayay...) 
SL. S517 527.3 FO 
Substitute (1.31) back into (9.30), hence we have 
M (S52 52...5, MO Soy sy...5u) 
Sy! 5! a VS 511 51.5 = 95 69...6n 
M(S515...5,) = Se O (9.32) 
i 5 MS? (Ss; 5.5 JM (Soy sy ...51) 
Ss... OS sr sl... =0 
when $5, 69...6, = Í, 
M (Six 5q...5n) = 0 (9.33) 


Thus, equations and (9.33) are DS combination rule. Where space S = {51, 52,...,8n} is the 


frame of discernment. 
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The physical meaning of equations (9.8) and (9:32)-(9.33) is different. Equation (9.8) indicates the 
probability judgement combination in the combination space (X, Q, u) of n independent information 
sources, while equations (9.32)-(9.33) denotes the probability judgement combination in the mapping 
space (S, 2%, M) of n independent information sources. The mappings of T and Tj, (i = 1,2,...,n) relate 
equations (9.8) and (9.32)- (9.33). This shows the difference between Dempster’s rule and DS combination 


rule. 


9.3.3 New explanations for the problems in DS combination rule 


From the above derivation, it can be seen that DS combination rule is mathematically based on the prod- 
uct combination rule of multiple independent information sources as evident from equations (9.1)- (9.3). 
For each of the elements in the space, the combination probability judgement of independent information 
sources is the result of the simultaneous probability judgement of each independent information source. 
That is, if each information source yields simultaneously its probability judgement for the element, then 
the combination probability judgement for the element can be obtained by DS combination rule, re- 
gardless of the magnitude of the judgement probability of each information source. Otherwise, it is the 


opposite. This gives raise to the following problems: 


1. The counter-intuitive results 


Suppose a frame of discernment is S = {s1, 2,83}, the probability judgments of two independent 


information sources, (X;, Qi, wi), i = 1,2, are mı and ma, respectively. That is: 


(X1,01, 41): mi(si) = 0.99, mı(s2) = 0.01 


and 


(X2,Q2, M2): me(s2) = 0.01, ma(s3) = 0.99 
Using DS rule to combine the above two independent probability judgements, results in 
m(s2) = 1,m(s1) = m(s3) = 0 (9.34) 


This is counter-intuitive. The information source (X1, 1, 11) judges sı with a very large probability 
measure, 0.99, and judges s2 with a very small probability measure, 0.01, while the information 
source (X2, Q2, u2) judges s3 with a very large probability measure, 0.99, and judges sz with a very 
small probability measure, 0.01. However, the result of DS combination rule is that s2 occurs with 
probability measure, 1, and others occur with zero probability measure. The reason for this result 
is that the two information sources simultaneously give their judgement only for an element sa of 


space S = {s1, 82,83} although the probability measures from the two information sources for the 
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element are very small and equal to 0.01, respectively. The elements sı and s3 are not judged by 
the two information sources simultaneously. According to the product combination rule, the result 


in equation is as expected. 


It should be pointed out that this counter-intuitive result is not completely due to the normalization 
factor in highly conflicting evidence [17 [18] [19] of DS combination rule. This can be proven by the 


following example. 


Suppose for the above frame of discernment, the probability judgments of another two independent 


information sources, (X;, Qi, wi), i = 3,4, are ma and ma, are chosen. That is: 
(X3, Q3, u3) : mg3(s1) = 0.99, ma(S) = 0.01 


and 


(Xa, Q4, u4) : ma(s3) = 0.99, mal S) = 0.01 


The result of DS combination rule is 
m/'(s1) = 0.4975, m'(s3) = 0.4975, m'( S) = 0.0050 


This result is very different from that in equation (9.34) although the independent probability 
judgements of the two information sources are also very conflicting for elements sı and s3. That 
is, the information source, (X3, 3, u3), judges sı with a very large probability measure, 0.99, and 
judges S with a very small probability measure, 0.01, while the information source (X4, Ma, u4) 
judges s3 with a very large probability measure, 0.99, and judges S with a very small probability 


measure, 0.01. 


This is due to the fact that the same element S = {s1, 52,83} of the two information sources 
includes elements sı and s3. So, the element sı in the information source, (X3, 03, u3), and the 
element S = {51,582,583} in the information source, (X4, Q4, 14) have the same information, and 
the element S = {s1, s2, 83} in information source, (X3, Q3, 13), and the element s3 in information 
source, (X4, Q4, p4) have the same information. Thus, the two independent information sources can 
simultaneously give information for the same probability judgement element S = {s1, 82,53}, and 
also simultaneously yield the information for the conflicting elements sı and s3, respectively. That 


is required by the product combination rule. 


2. The combination of Bayesian (sensitive) information sources 


If two Bayesian information sources cannot yield the information about any element of the frame 
of discernment simultaneously, then the two Bayesian information sources cannot be combined 


by DS combination rule. For example, there are two Bayesian information sources (X1, 91, 11) 
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and (X2, Q2, 42) over the frame of discernment, S = {s1, 52, 53, S4}, and the basic probability 


assignments are, respectively, 
(X1,01, 11): mi(si) =0.4, mı(s2) = 0.6 


and 


(X2, Qe, u2) : ma(s3) = 0.8, m2(s4) = 0.2 


then their DS combination rule is 








m(s1) = m(s2) = m(s3) = m(s4) = 0 








This indicates that every element of the frame of discernment occurs with zero basic probability 
after DS combination rule is applied. This is a conflict. This is because the source (X1, Q1, 11) 
gives probability judgements for elements sı and sa of the frame of discernment, S = {81, S2, 83, S4}, 
while the source (X2, Q2, 112) gives probability judgements for elements s3 and s4 of the frame of dis- 
cernment, S = {s1, 82, 83,84}. The two sources cannot simultaneously give probability judgements 
for any element of the frame of discernment, S = {81, 82, 53,84}. Thus, the product combination 


rule does not work for this case. 


Based on the above analysis, a possible solution to the problem is to relax the conditions used in 
the product combination rule (equations (L-1)-(9:4)) for practical applications, and establish a new 


theory for combining information of multiple sources (see sections RA and R5). 


9.3.4 Remark about “multi-valued mapping” in Shafer’s paper 


On page 331 of where G. Shafer explains the concept of multi-valued mappings of DS combination 
rule, the Dempter’s rule is considered as belief, Bel(T) = P{a|[ (x) € T,VT C S}, combination. The 


following proof shows this is incorrect. 


Proof: Given the two independent information sources, equations (9.1)-(9-4) become as the followings: 


X= Xi x Xə (9.35) 
Q= NQ x M (9.36) 
u = pa X pa (9.37) 


Tz =TyanTox (9.38) 
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then 


Bel(T) 4 Bel, (T) © Bel2 (T) 


in fact, YT CS, 
{P(2) CT)» (T(e1) € T} A (Pío) CT) 


hence, 


{x € X|P(x) CT) 4 {a1 € Xi T(e1) C T} x {z2 € XalP (xo) CT) 


i.e. the product combination rule in equations (9.35)- (9.38) is not satisfied by the defined belief Bel(T) = 
P4x|I (x) € T,VT C S}. Therefore, the combination belief cannot be obtained from equations (0,35)- 
with the belief, Bel(T) = Pfx|D(x) € T,VT C S}. When we examine the product combination 
rule in equations (9.1)-(9.4), it is known that the combination rule is neither for upper probabilities, nor 
for lower probabilities (belief), nor for probabilities of the type, Ps,s»...s, = M(X5,0,...5,) [I]. It is simply 


for probability spaces of multiple independent information sources with multi-valued mappings. 


9.4 A new combination rule of probability measures over map- 
ping space 


It has been demonstrated in section[9.3]that DS combination rule is mathematically based on the product 
combination rule of multiple independent information sources. The combination probability judgment of n 
independent information sources for each element is the result of the simultaneous probability judgment 
of each independent information source. That is, if each information source yields simultaneously its 
probability judgment for the element, then the combination probability judgment for the element can 
be obtained by DS combination rule regardless of the magnitude of the judgment probability of each 
information source. Otherwise, such results are not plausible. This is the main reason that led to 
the counter-intuitive results in [19]. We will redefine the combination-mapping rule [ using n 
independent mapping T;, 7 = 1,2,...,n in order to relax the original definition in equation in 
section [9.2.1] The combination of probabilities of type Pe 5, in the product space (X, Q, jz) will then 
be realized, and, furthermore, the combination rule of multiple sources of information over mapping space 
S will also be established. 


9.4.1 Derivation of combination rule of probabilities B rni 


Define a new combination-mapping rule for multiple multi-valued mappings as 


Te =TyxUT>xU...UT,z (9.39) 
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It shows that x; € X is consistent with a particular s € S if and only if s € T;a;, for i= 1,2,...,n, 
and consequently « = {£1, £2,..., £n} € X is consistent with that s if and only if there exist certain 


¿€ {1,2,...,n}, such that s € Tizi. 


For any T C S, we construct sets 


T={reXx,lrcT} (9.40) 
T; = {x; E Xi, lizi C T} (9.41) 

and let 
A(T) =u(T) (9.42) 
A (T) = pi (T;) (9.43) 

Hence, 
T= xT x... x To (9.44) 

and 

MT) = AH (T) x A® (T) x... x A® (T) (9.45) 


Consider a finite S = {s1, s2, 53} and two independent sources of information characterized by p, p,, 
vaste Hien) rake Dior pe, and p, i= 1,2. Suppose A(T), (i = 1,2) corresponding to T = 0, {s1}, 
{so}, {ss}, {51:852}, (sa, s3), {51 sa), (51, 52, 83} is expressed as Afo» Afo Abro» ASt Alto» Alors Abt 


and A(?,, i= 1,2. Then for i = 1,2, 











Aia = Poo (9.46) 
A do = Poo + po (9.47) 
Mo = pio + Poio (9.48) 
A = Poo + Poe (9.49) 
A ‘a = Pho + Pido T Dito + Pio (9.50) 
A E = Pho + pido T Poon T Pion (9.51) 
A = Pedo + Pio T poor + Poin (9.52) 
A A = Ba ES P\Qo F Pio T phos T pito T Pia + ph T py (9.53) 








If Ms,825, ANd P5,56, (Ôi = 1 or 0, i = 1,2,3) are used to express the combined probability measure of 
two independent sources of information in spaces S = {s1, 82,53} and (X, Q, u), respectively, then based 


on equation and through equations (9-46)- (9.53), the following can be obtained 
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Pooo = PSPS (9.54) 
Proo = PhooPioo + PiooPooo + Pinpon (9.55) 
Poio = POPO T DDPO T PUP (9.56) 
poor = piopio, + Pior PSon + PoorPoor (9.57) 








(1) „(2 (1) (2) (1), (2) (1), (2) 
P110 = PoooP 110 + PicoPo10 + PicoP110 + Po10P 100 























+ pipii + pipio + pipio + pipii + pipii (9.58) 
pior = piP on + Piopio + Piopio + Piopio 

+ PP R F pD o $ DOP a ls PO oF prop (9.59) 
poni = Poop + PoioPoor + PoroPort + PoorPoro 


1) (2 1) (2 1) (2 1) (2 1) (2 
T IALA + PADS ais PADS g Psy Poor E POP (9.60) 


1) (2 1) (2 1) (2 1) (2 
P111 = piopi ols Propper + Ppi fle Ca 


1) (2 1) (2 1) (2 1) (2 1) (2 
-ppf T PAP Tr papi zg PEAP T AAA 


1) (2 1) (2 1) (2 1) (2 1) (2 
T PAP Tr PAPIA ay PIPS) EN Pb TP Dpi 

1) (2 1) (2 1) (2 1) (2 1) (2 
T props T PIDPA + PDA TT pip or TP DPA 
(1), @ (1), @ GQ, e (1), (2) 1) (2) 
- P111P000 T P111P100 T P111P010 T P111P001 T P111P011 


1) (2 1) (2 1) (2 
T pipio Tr PAP T A (9.61) 


















































For the case of S = {s1,52,...,5n}, the general combination rule is 
_ 5 : (1) (2) 
S¿=8/US!' 
i=1,2,...,m 


for all (64, 55,...,0%,,.6",64,...,6"). 


rm) 


9.4.2 Combination rule of probability measures in space S 


Define a probability measure over 2% = (S5,5,...5, } as M : 2% = {S5,5...5,,} — [0, 1] with 


0, 95,59...5n = S00...0 
M(S5169...6n) = (9.63) 


p(X5, 59...5n) 
Ta)? 98182..5n F S00...50 


where M is the probability measure over a class 2% = {55,65,...5, } of subsets of space S and T maps X 


into S. 
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The combination rule: 
Given two independent sources of information (X;, Q;, pi), i = 1,2, and the corresponding mapping 


space, S =451,82,...,8n) = [S5,0...5, ), where T; maps X; into S. Based on equation (9.62), we have 


1 2 
a y IO E ea) (9.64) 
5¿=8/US/ 
i=1,2,....n 


From equation (9,63), for any S5,5....5, + Soo...0, there exists 


p(X, y) = MO Sr ón A — wD (XG 0) (9.65) 
MA an) = MOS 1 (A 0) (9.66) 
H 51 54...6") = 61652..-5n H 00...0 : 


and 


MX 51 69...5n) = M (S5182...8n )(1 — u(Xoo...0)) (9.67) 


such that equation becomes 


SO MO (Sr...) MP ($5459...8n)[L MA (A) 


$¿=9/U8! 
i=1,2,....n 


a A PU o: ë eB 
vetada 1 — u(Xoo...0) p08) 
and 
[1 - uV (XE MAPA 1 T 
1 — (Xoo...o) SS MUS... MO (Seu oy...) 
ô; U8; 40 
i=1,2,....n 
Substitute (9.69) into (9.68), 
5 MY Ss... MO Srs...) 
5: =5(U5"! 
i=l, 2 
M(Ss 5 bn) == A AAA 
D 1- Y M®(Sss,..5,) MO (Sspog...51) 
108; #0 
i=1,2,....n 
= Y MUS. MP (Sour 507.52) (9.70) 
5:=6/U5"! 
i=1,2,....n 
If 551 69...5n = Soo...0; we define 
M (85.85...) & 0 (9.71) 


Hence, equations (9.70)- express the combination of two sources of information, (X;, Q4, pi), i = 1,2, 


for the mapping space, S = {s1, $2,...,5n} = S5,65...5,, Where T; maps X; into S. 
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9.5 The disjunctive combination rule 


Based on the results in section [9.4] the disjunctive combination rule for two independent sources of in- 


formation is obtained as follows: 


Theorem 4: 

Suppose O = {61,62,...,0,} is a frame of discernment with n elements. The basic probability 
assignments of the two sources of information, (X1, Q1, u2) and (X2, Q2, 42) over the same frame of 
discernment are mı and ma, and focal elements A1, 42, ..., A, and By, B2, ..., By, respectively. Then 


the combined basic probability assignment of the two sources of information can be defined as 
m(C) = (9.72) 


Proof: Since m(Q) = 0 by definition, m is a basic probability assignment provided only that the m(C) 


sum to one. In fact, 


5 m(C) =m(0) + 5 m(C) 


CcoO cco 
CHO 
= Y 5 m1 (A;)m2(B;) 
cco C=A¿UB; 


CA ¿e(1,2,...,k),j€(1,2,...,1) 


— 5 m,(A;)me (B;) 
A,UB;#0 
1€(11,2,...,k),5€11,2,...,1) 


XO mi(Ai) Y me(B;) 


A¡¿CO B; co 
Hence, m is a basic probability assignment over the frame of discernment O = {61,62,...,4n}. Its 


focal elements are 


Based on theorem 4, theorem 5 can be stated as follows. A similar result can be found in [29] BY]. 


Theorem 5: 
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If Bel; and Belz are belief functions over the same frame of discernment O = {61, 02, . . . , On } with basic 
probability assignments mı and ma, and focal elements A1, Ag, ..., Ax and By, Bo, ..., By, respectively, 


then the function m : 2° — [0,1] defined as 


0, C= 
HIRIS S mí4)maA(B), C#0 ee) 


C=A;,UB; 


yields a basic probability assignment. The core of the belief function given by m is equal to the union of 


the cores of Bel; and Bel». 


Physical interpretations of the combination rule for two independent sources of information are: 


1. The combination rule in theorem 4 indicates a type of logical inference in human judgments, namely: 
the OR rule. That is, for a given frame of discernment, the elements that are simultaneously 
judged by each source of information will also be judgment elements of the combined source of 
information; otherwise, it will result in uncertainty so the combination judgments of the elements 


will be ignorance. 


2. The essential difference between the new combination rule and DS combination rule is that the 
latter is a type of logical inference with AND or conjunction, while the former is based on OR 
or disjunction. The new combination rule (or the OR rule) overcomes the shortcomings of DS 
combination rule with AND, such as in the counter-intuitive situation and in the combination of 


sharp sources of information. 


3. The judgment with OR has the advantage over that with AND in treating elements that are not 
simultaneously judged by each independent source of information. The OR rule gives more plausible 
judgments for these elements than the AND rule. The judgment better fits to the logical judgment 


of human beings. 


Example 1 


Given the frame of discernment O = {01,62}, the judgments of the basic probability from two sources of 


information are mı and ma as follows: 
m1(01) =0.2, m1(02) =0.4, m1(01,02) = 0.4 


m2(01) = 0.4, mə(02) =0.4, ma(91,02) = 0.2 


Then through theorem 4, the combination judgment is 


m(0,) = 0.08, m(02) =0.16, m(0,,02) = 0.76 
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Comparing the combined basic probabilities of 0; and 62, the judgment of 62 occurs more often than 01, 
but the whole combination doesn’t decrease the uncertainty of the judgments, which is evident from the 


above results. 


Example 2 (the counter-intuitive situation) 


Zadeh’s example: 


The frame of discernment about the patient is O = {M,C,T}where M denotes meningitis, C repre- 


sents contusion and T indicates tumor. The judgments of two doctors about the patient are 
mı(M) =0.99, mi(T) =0.01 


ma(C) = 0.99, ma(T)=0.01 


Combining these judgments through theorem 4, results in 
m(M UC) = 0.9801, m(MUT) = 0.0099, m(CUT) =0.0099, m(T) = 0.0001 


From m(M UT) = 0.0099 and m(C'UT) = 0.0099, it is clear that there are less uncertainties between 
T and M, as well as T and C; which implies that T can easily be distinguished from M and C. Also, 
T occurs with the basic probability m(T) = 0.0001, i.e. T probably will not occur in the patient. The 
patient may be infected with M or C. Furthermore, because of m(M U C) = 0.9801, there is a bigger 
uncertainty with 0.9801 between M and C, so the two doctors cannot guarantee that the patient has 
meningitis (M) or contusion (C) except that the patient has no tumor (T). The patient needs to be 


examined by more doctors to assure the diagnoses. 


We see the disjunctive combination rule can be used to this case very well. It fits to the human 


intuitive judgment. 


9.6 Properties of conjunctive and disjunctive combination rules 


In the section, the conjunctive and disjunctive combination rules, namely, Dempster-Shafer’s combination 
rule, Yager’s combination rule, Dubois and Prade’s (DP) combination rule, DSm’s combination rule and 
the disjunctive combination rule, are studied. The properties of each combination rule of evidence are 
discussed in detail, such as the role of evidence of each source of information in the combination judgment, 
the comparison of the combination judgment belief and ignorance of each combination rule, the treatment 
of conflict judgments given by the two sources of information, and the applications of combination rules. 
Zadeh’s example is included in this section to evaluate the performance as well as efficiency of each 


combination rule of evidence for the conflict judgments given by the two sources of information. 
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9.6.1 The combination rules of evidence 
9.6.1.1 Yager’s combination rule of evidence 


Suppose Bel, and Belg are belief functions over the same frame of discernment O = {61, 62,...,9n} with 
basic probability assignments mı and ma , and focal elements A;, 42, ..., Ak and By, Bo, ..., Bi, 
respectively. Then Yager’s combined basic probability assignment of the two sources of information can 


be defined as [20] 


C=AINB, 
my (C) = § mi(®)m2(®)+ Y mi(Ai)m2(B;), C=0 (9.74) 
AwnB,=0 
0, C=0 


9.6.1.2 Dubois & Prade (DP)’s combination rule of evidence 


Given the same conditions as in Yager’s combination rule, Dubois and Prade’s combined basic probability 


assignment of the two sources of information can be defined as [26] 


Y mí(A)maB)+ X mi(Ai)ma(By), C#0 


ij ij 
— 2 C=A;NB; C=A,UB; 
mpp(C) j Ane 0 (9.75) 
9.6.1.3 DSm combination rules of evidence 


These rules are presented in details in chapters [and Aand are just recalled briefly here for convenience 


for the two independent sources of information. 


e The classical DSm combination rule for free DSm model 


VWCED?, m(C)= Y mi(A)m2(B) (9.76) 


A,BED? 
ANB=C 


where DÈ denotes the hyper-power set of the frame O (see chaptersP]and[B] for details). 


e The general DSm combination rule for hybrid DSm model M 


We consider here only the two sources combination rule. 


VAED®, — mmola) ê (A) [$1(4) + So(A) + 83(A) (9.77) 
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where ¢(A) is the characteristic non emptiness function of a set A, i.e. ¢(A) = 1 if A ¢ Ø and 
¢(A) = 0 otherwise, where Ø £ {04,0}. Øm is the set of all elements of DO which have been 
forced to be empty through the constraints of the model M and @ is the classical/universal empty 


set. S1(A) = myys(e)(A), $2(A), S3(A) are defined by (see chapter Ø) 


2 
sae Y [mw A 
Xı,X2€ DÈ i=1 
XiNX2=A 
2 
So(A) Ê 5 [[ tx) ee 
X1,X2€0 i=1 
[U=A]V[U ED) A(A=14)] 
2 
sae Y» [m as 
X1,X2ED° i=1 
X1UX2=A 
X1NX2E0 


with U = u(X1)Uu(X2) where u(X) is the union of all singletons 0; that compose X and I; * 01U02 
is the total ignorance. 5 (A) corresponds to the classic DSm rule of combination based on the 
free DSm model; S2(A) represents the mass of all relatively and absolutely empty sets which is 
transferred to the total or relative ignorances; S3(A) transfers the sum of relatively empty sets to 


the non-empty sets. 


9.6.1.4 The disjunctive combination rule of evidence 


This rule has been presented and justified previously in this chapter and can be found also in [22] 123] 29] 


BO 81. 


Suppose O = {01, 02, . . . , Ôn } is a frame of discernment with n elements (it is the same as in theorem 3). 
The basic probability assignments of the two sources of information over the same frame of discernment 
are mı and ma, and focal elements A,, A2,..., Ax and B,, Bg, ..., Bı, respectively. Then the combined 


basic probability assignment of the two sources of information can be defined as 


5 m(A;)me(B;), C al 0 
mpis(C) = GF wes (9.81) 
0, C=6 
for any C C O. The core of the belief function given by m is equal to the union of the cores of Bel; and 


Belg A 
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9.6.2 Properties of combination rules of evidence 


Given two independent sources of information defined over the frame of discernment O = (01,02), their 


basic probability assignments or basic belief masses over O are 
S1: m1(01) = 0.4, my (02) = 0.3, mi (01 U 92) = 0.3 


S2 : ma(01) = 0.5, ma(02) = 0.3, ma(01 U 92) = 0.2 


Then the results of each combination rule of evidence for the two independent sources of information 


are as follows. For the frame of discernment with n elements, similar results can be obtained. 


S2 (mz) \ St (ma) | (0) (04) (62) (03) (01,02) (0.3) 


{61} (0.2) {01} 0102) > k (0.15) | {01} (0.15 
( 


(61) (0.5) | 15) 
(02) (0.3) | {1} n {02} > k (0.12) (02) (0.09) 
(01,02) (02) | } (0.08) {65} (0.06) | {01,02} (0.06) 


Table 9.1: The conjunctive combination of evidence (DS) 





52 (ma) Y S1 (ma) || 10:00) (62) (0.3 (61,62) (03) 


(305) | (30 (010402) = © (0-15) | (91) (0.15) 
(92) (0.3) |] {01}. 9 {42} + © (0.12) {62} (0.09) {62} (0.09) 
{61,62} (0.2) || 40} (0.08) (02) (0.06) | (01,02) (0.06) 


Table 9.2: The conjunctive and disjunctive combination of evidence (Yager) 


52 (ma) N_51 (mu) | {01} (0.4) (62) (0.3 {61,62} (0.3) 
{91} (0.5) {91} (0.2) {01} N {82} > {891} U {02} (0.15) | (01,02) (0.15) 
0.3) 








| {62} (0.3) [| {81}. {42} = {01} U (02) (0.12) {62} (0.09) (02) (0.09) 
| ) 


093 1.09 008 


Table 9.3: The conjunctive and disjunctive combination of evidence (Dubois-Prade) 


101,02) (0.2) 


Property 1: the role of evidence of each source of information in the combination judgment: 


1. With DS combination rule of evidence [2], the combined judgment for element 6; (i = 1,2) consists 
of two parts. One is from the simultaneous support judgment of two sources of information for 
the element 0, (i = 1,2) and the other is that one of two sources of information yields a support 


judgment, while the second source is ignorant for the element 0, (i = 1,2) (i.e. ignorance). The 
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Table 9.4: The hybrid DSm combination of evidence 


52 (m2) \ 51 (ma) | 101) (04) (62) (03) | (01,02) (03) 


{a} (0.5) [| (03(02 | {01} U {42} (0.15) | (01,02) (0.15) 
(02) (0.3) | {1} U {0} (0.12) | (0) (0.09) | {01.42} (0.09) 





(61,02) (0.2) || (0,02) (0:05) | (01,02) (0.06) | (01,02) (0.06) 


Table 9.5: The disjunctive combination of evidence 


combined total ignorance is from the total ignorance of both sources of information. The failure 
combination judgment for some element is from the conflict judgments given by two sources of 


information for the element. 


2. The difference between Yager’s combination rule of evidence and DS combination rule of evi- 
dence [2] is that the conflict judgments of combination given by two sources of information for some 


element is considered to be a part of combined ignorance i.e. it is added into the total ignorance. 


3. Dubois and Prade's combination rule of evidence [26] is different from that of Yager’s combination 
rule in that when two sources of information give the conflict judgments for an element in the 
frame of discernment, one of two judgments is at least thought as a reasonable judgment. The 
conflict judgments of combination for the two conflict elements are distributed to the judgment 


corresponding to union of the two conflict elements. 


4. The classical DSm combination rule of evidence is different from those of Dubois and Prade's 
[26], Yager’s [20] and DS [2]. The conflict judgments given by two sources of information for an 
element in the frame of discernment are considered as paradox. These paradoxes finally support 
the combination judgment of each element 0, (i = 1,2). For the hybrid DSm combination rule, see 
chapter[A] it consists of three parts. The first one is from the classic DSm rule of combination based 
on the free-DSm model;the second one is the mass of all relatively and absolutely empty sets which 
are transferred to the total or relative ignorance, while the third one is the mass that transfers the 


all relatively empty sets to union of the elements that are included in the sets. 
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5. With the disjunctive combination rule of evidence [22] 23] [29] 50] BI], the combination judgment 
for each element is only from the simultaneous support judgment of each source of information 
for the element 0, (i = 1,2). The combined ignorance consists of the combination of conflict 
judgments given by two sources of information, the combination of the ignorance given by one 
source of information and the support judgment for any element given by another source, and the 
combination of the ignorance from both sources of information simultaneously. There is no failure 


combination judgment. However, the combined belief is decreased and the ignorance is increased. 


6. The combination rules of evidence of DS and the classical DSm are the conjunctive rule, the dis- 
junctive combination rule of evidence is the disjunctive rule, while the combination rule of evidence 


of Yager, Dubois & Prade, and the hybrid DSm are hybrid of the conjunctive and disjunctive rules. 


Property 2: the comparison of combination judgment belief (Bel(.)) and ignorance (Ign(.) = PU.) — 


Bel(.)) of each combination rule is: 


Belps(0;) > Belpsm(0;) > Belpp(0;) = Bely (6;) > Belp;s(0;), i=l? (9.82) 


Ignps(0) < Ignpsm (0i) > lgnpp(0) < Igny (4%) < Ignpis(0:), i=1,2 (9.83) 


In fact, for the above two sources of information, the results from each combination rule are as the 


following: 


Combination rule || m(0,) | m(02) | m(@) | Bel(4,) | Bel(92) | BeO) | 1en(6:) | ten(02) 


DS | 0.589 0.329 0.082 | 0.589 0.329 a 0.082 0.082 
ve fos [021 [oae [oe [oa | 1 os | om 


os om [oa [om [om [oa | 1 | om | om 
Dwane | om o [om J om [oo | 1 | om [om 





[or [oe [oar [om [ove [oar [1 | om | om 


From the results in the above table, it can be observed that the hybrid DSm’s, Yager’s and Dubois & 
Prade’s combination judgments are identical for the two independent sources of information. However, 
for more than two independent sources of information, the results of combination judgments are as in 
equations (9.82) and (0.83) (i.e. the results are different, the hybrid DSm model is more general than 


Dubois-Prade’s and Yager’s, while Dubois-Prade’s model has less total ignorance than Yager’s). 
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Property 3: The conflict judgments given by two sources of information for the frame of discernment: 


Under DS combination rule, the combined conflict judgments are thought as failures and are deducted 
from the total basic probability assignment of combination, while under Yager’s combination rule, they 
are thought as the total ignorance; under Dubois & Prade’s combination rule; they are distributed to the 
union of the two conflict elements. That means one of conflict judgments is at least reasonable. Under 
the classical DSm combination rule, they constitute paradoxes to support the combined judgment belief 
of each element, and are also thought as a new event that takes part in the subsequent judgment when 
new evidences occur. While for the hybrid DSm combination rule, the treatment of conflict evidence is 
similar to Dubois & Prade’s approach. For the disjunctive combination rule, the conflict judgments of 
combination constitute ignorance, and take part in the subsequent judgment when the new evidences 


occur. 


Property 4: using them in applications: 


Based on properties 1-3, when the two independent sources of information are not very conflict, the 
disjunctive combination rule is more conservative combination rule. The combined results are uncertain 
when conflict judgments of two sources of information occur and hence the final judgment is delayed until 
more evidence comes into the judgment systems. Also, the combined judgment belief for each element 
in the frame of discernment is decreased, and ignorance is increased as the new evidences come. Hence, 
the disjunctive combination rule is not more efficient when we want the ignorance be decreased in the 
combination of evidence. It is fair to assume that for the case when the two (conflict) judgments are not 
exactly known which one is more reasonable, however, at least one of them should provide a reasonable 
judgment. But DS combination rule is contrary to the disjunctive combination rule. It can make the final 
judgment faster than other rules (see equations (9.82)-(9.83)), but the disjunctive combination rule will 
make less erroneous judgments than other rules. The cases for the combination rules of the hybrid DSm, 
Dubois & Prade, and Yager’s combination rule fall between the above two. For the other properties, for 
instance, the two conflict independent sources of information, see the next section and the example that 


follows. 


9.6.3 Example 


In this section, we examine the efficiency of each combination rule for conflict judgments via Zadeh’s 
famous example. Let the frame of discernment of a patient be O = {M,C,T} where M denotes meningitis, 


C represents contusion and T indicates tumor. The judgments of two doctors about the patient are 
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mi(M) =0.99,m,(T) =0.01 and  ma(C)=0.99,ma(T) = 0.01 


The results from each combination rule of evidence are: 


| Rules | mT) [m(MUC)|m(CUT) | m(MUT) | m) 
MEA ESA A A 


| Dp | 0.0001 0.9801 0.0099 0.0099 EA 
Hybrid DSm | 0.0001 0.9801 0.0099 0.0099 ER] 
Disjunctive | 0.0001 0.9801 0.0099 0.0099 EM] 





The basic belief masses m(MAC), m(CNT) and m( MAT) equal zero with all five rules of combination 
and the belief of propositions MNC, COT, MAT, MUC, CUT, MUT, M,C, T and MUCUT are 


given in the next tables: 





Comparison and analysis of the fusion results: 


1. DS combination judgment belief of each element is: 


Belps(T) = 1, Belps(M) = Belps(C) =0 
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It means that the patient must have disease T with a degree of belief of 1 and must not have diseases 
M and C, because their degrees of belief are 0, respectively. It is a counter-intuitive situation with 
Belps,i(M) = Belps,2(C) = 0.99, Belps (T) = Belps 2(T) = 0.01. Moreover, in spite of the basic 
probability assignment values over diseases T, M and C, the judgment of the two doctors for DS 
combination rule will always be T with the degree of belief of 1, and each M and C with degree 
of belief of 0. It shows DS combination rule is not effective in this case. The main reason for this 


situation has been presented in sections [9.3)9.5] 
2. Yager’s combination judgment belief of each element is: 
Bely (T) = 0.0001, Bely (M) = Bely (C) = 0 


This degree of belief is too small to make the final judgment. Therefore, Yager’s combination rule 
of evidence will wait for the new evidence to come in order to obtain more accurate judgment. The 


reason for this result is that the rule transforms all conflict judgments into the total ignorance. 
3. For Dubois & Prade’s combination rule, there is 
Belpp(T) = 0.0001, Belpp(M U C) = 0.9801, Belpp(M U T) = Belpp(C U T) = 0.01 


This result is the same as that of the disjunctive combination rule and the hybrid DSm combination 
rule. With a belief of T, Belpp(T') = 0.0001, we can judge that the patient having disease T is less 
probable event. Furthermore, Belpp(M UT) = Belpp(C UT) = 0.01, hence the patient may have 
disease M or C. Also, Belpp(M UC) = 0.9801, this further substantiates the fact that the patient 
has either M or C, or both. For the final judgment, one needs the new evidence or diagnosis by 


the third doctor. 


Based on the judgments of two doctors, the different judgment results of each combination rules are 
clearly demonstrated. For this case, the results from Dubois & Prade’s rulr, the hybrid DSm rule and 
from the disjunctive combination rule are more suitable to human intuitive judgment; the result from 
Yager’s combination rule, can’t make the final judgment immediately because of less degree of judgment 
belief and more ignorance, while the results of DS combination rule is counter-intuitive. These results 
demonstrate the efficiency of each combination rule for the conflict judgments given by two sources of 


information for the element in the frame of discernment. 


9.7 Conclusion 


In this chapter, DS combination rule is examined based on multi-valued mappings of independent in- 
formation sources and the product combination rule of multiple independent information sources. It is 


obtained that Dempster’s rule is different from DS combination rule and shortcomings in DS combination 
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rule are due to the result of the product combination rule. The drawback in the explanation of multi- 
valued mappings when applied to Dempster’s rule were pointed out and proven. Furthermore, based 
on these results, a novel justification of the disjunctive combination rule for two independent sources of 
information based on the redefined combination-mapping rule of multiple multi-valued mappings in the 
product space of multiple sources of information mappings has been proposed. The combination rule 
depends on the logical judgment of OR. It overcomes the shortcomings of Dempster-Shafer’s combina- 
tion rule, especially, in resolving the counter-intuitive situation. Finally, the conjunctive and disjunctive 
combination rules of evidence, namely, Dempster-Shafer’s (DS) combination rule, Yager’s combination 
rule, Dubois & Prade’s (DP) combination rule, DSm’s combination rule and the disjunctive combination 
rule, are studied for the two independent sources of information. The properties of each combination 
rule of evidence are discussed in detail, such as the role of evidence of each source of information in 
the combination judgment, the comparison of the combination judgment belief and ignorance of each 
combination rule, the treatment of conflict judgments given by the two sources of information, and the 
applications of combination rules. The new results yield valuable theoretical insight into the rules that 
can be applied to a given situation. Zadeh’s typical example is included in this chapter to evaluate the 
performance as well as efficiency of each combination rule of evidence for the conflict judgments given by 


the two sources of information. 
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Chapter 10 


Comparison between DSm and 


MinC combination rules 


Milan Daniel 
Institute of Computer Science 
Academy of Sciences of the Czech Republic 
Pod vodárenskou věží 2, CZ - 182 07 Prague 8 
Czech Republic 


Abstract: Both DSm and minC rules of combination endeavor to process conflicts 
among combined beliefs better. The nature of conflicts as well as their processing 
during the belief combination is sketched. An presentation of the minC combination, 
an alternative to Dempster’s rule of combination, follows. Working domains, struc- 
tures and mechanisms of the DSm and minC combination rules are compared in the 


body of this chapter. Finally, some comparative examples are presented. 


10.1 Introduction 


he classical DSm rule of combination, originally presented in [5] [6], has served for combination of 
Ti or several beliefs on the free DSm model. Later, a hybrid DSm combination rule has been 
developed to be applicable also on the classical Shafer (or Dempster-Shafer, DS) and the hybrid DSm 
model. The present state of the DSm rule is described in Chapter f] see Equations (7-10). 


Partial support by the COST action 274 TARSKI is acknowledged. 
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MinC combination (minimal conflict /minimal contradiction) rule introduced in [2] [4] is an alternative 
to the Dempter’s rule of combination on the classical DS model. This rule has been developed for better 
handling of conflicting situations, which is a weak point of the classical Dempster rule. A brief description 


of the idea of the minC combination is presented in Section [10.3] 


Both arguments and results of the DSm rule are beliefs in a DSm model, which admits intersections 
of elements of the frame of discernment in general. The minC combination serves for combination of clas- 
sical belief functions (BFs) where all intersections of elements (of the frame of discernment) are empty 


and their resulting basic belief masses should be 0. 


For finer processing of conflicts than the classical normalization in Dempster rule, a system of different 
types of conflict (or empty set) is introduced. For representation of intermediate results, generalized BFs 
serve on generalized frames of discernment which contains elements of the classical DS frame of discern- 


ment and correspondent types of conflict. 


Even if the two developed approaches were originally different (disjoint), as well as the paradigms of 
both approaches, the intermediate working generalized beliefs of the minC combination are similar to 
those in the free DSm model, and the way of combination on the generalized level is analogous to that 
in the free DSm model. This surprising fact is the main reason why we compare these two seemingly 


incomparable, and originally quite disjoint approaches. 


Now, after the development of the DSm combination for any hybrid DSm model, it is, moreover, 
possible to compare behavior of both approaches on classical BFs, i.e. in the application domain of the 


minC combination. 


10.2 Conflict in belief combination 


In the DSm combination, which is specially designed for conflicting situations, there are no problems 


with conflicts. 


The common similar principle for Dempster rule, the minC combination and the DSm combination 
rule is that the basic belief assignment /mass (bbm) m1(X), assigned to set X by the first basic belief 
assignment (bba) mı, multiplied by bbm ma(Y), assigned to set Y by the second bba ma, is assigned to 
the set X NY by the resulting bba m2, ie. m1(X)m2(Y) is a part of my2(X NY). 
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This principle works relatively nicely if sets X and Y are not disjoint. There is also no problem for 
the DSm rule because X NY is always an element of D? and its positive value is accepted even in the 


case of sets X and Y without any common element of O. 


In Dempster’s rule, disjoint X and Y tend to a conflict situation. All the conflicts are summed up 
together and reallocated onto 2° by normalization in the classical normalized Dempster’s rule, see g, 
or stored as m(@) in the non-normalized Dempster’s rule in Transferable Belief Model (TBM) by Smets, 
see 10] [11]. It is a fact that in Smets’ approach the normalization is only postponed from the combination 
process phase to the decisional one, as the normalization is the first step of computation of the classical 
pignistic transformation in TBM. The non-normalized Dempster rule commutes with the normalization, 
hence the pignistic probability is always the same in both the cases of normalized and non-normalized 


Dempster’s rule. 


A weak point of Dempster’s rule — combination of conflicting beliefs is caused by normalization or by 
grouping all the conflicts together by the non-normalized version of Dempster’s rule. Therefore, different 
types of conflict were introduced and a minC combination rule has been developed for a better handling 


of conflicting situations. 


10.3 The minC combination 


The minC combination (the minimal contradiction/conflict combination) of belief functions was developed 
PIA with an effort to find a new associative combination which processes conflicts better than Dempster’s 
rule. The classical Shafer model from Dempster-Shafer theory is supposed for both input and resulting 
belief functions. The minC combination is a e! of the un-normalized Dempster’s rule. m(() is 
not considered as an argument for new unknown elements of the frame of discernment, m(()) is considered 
as a conficl arising by conjunctive combination. To handle it, a system of different types of conflicts is 


considered with respect to sets which produce the conflicts. 


10.3.1 A system of different types of conflicts 


We distinguish conflicts according to the sets to which the original bbms were assigned by m;. There is 
only one type of conflict among the belief functions defined on a binary frame of discernment, hence the 


minC combination coincides with the non-normalized conjunctive rule there. 


1Note that, on the other hand, the minC combination approach is a special case of an even more general approach of 


combination belief functions ’per elements’, see [3] 
2The term “contradiction” is used in [2][4], while we use “conflict” here in order to have a uniform terminology. 


226 CHAPTER 10. COMPARISON BETWEEN DSM AND MINC COMBINATION RULES 


In the case of an n-ary frame of discernment we distinguish different types of conflicts, e.g. {61}x{6o}, 
{1} x {62,03}, {01} x {02} x {03}, (05,05, 0 }x{0m, 0n, 0o} etc. The symbol x serves here for a denotation 
of conflicts, it is not used as any new operation on sets. Thus e.g. {01} x (02,03) simply denotes the 


conflict between sets {01} and {62,03}. 


We assume that products of the conflicting bbms are temporarily assigned (we all the time keep in 
mind that Shafer’s constraints should be satisfied) to the corresponding conflicts: e.g. m1({61})me2({62}) 
is assigned to the conflict {81} x {02}. In this way we obtain so called generalized bbas, and generalized 


BFs on a generalized frame of discernment given by O. 


When combining 2 BFs defined on 3D frame O = {01,02,03} we obtain the following conflicts as 
intersections of disjoint subsets of O: {01}x {62}, {01} x {03}, {O2}x {Os}, (01,02) <X03), (01,03) {02}, 
and {62, 03) x (91). 


Because we need a classical BF as a result of the combination, we have to reallocate bbms assigned 
to conflicts among subsets of O after the combination. These bbms are proportionalized, i.e. propor- 
tionally distributed, among subsets of O corresponding to the conflicts. A few such proportionalizations 
are presented in M]. Unfortunately, all these proportionalizations break required associativity of the 
conjunctive combination. To keep the associativity as long as possible we must be able to combine the 


generalized belief functions with other BFs and generalized BFs. From this reason other conflicts arise: 


e.g. {01} x {02} x {03}, ({01, 02} x {61, 03)) x {02} x {3}, ({01, 02} x {63}) x ({02} x {03}), etc. 


A very important role for keeping associativity is played by so called partial or potential conflicts H 
e.g. a partial conflict (01,02) x {02,03} which is not a conflict in the case of combination of two beliefs 
(01,02) N {02,03} = {02}, but it can cause a conflict in a later combination with another belief, e.g. pure 

io. 02) x {02,03} x (01,03) because there is {61,02} N {02,03} NO {61, 63} = É, in Shafer’s 


or real conflict 


model. 


In order not to have an infinite number of different conflicts, the conflicts are divided into classes of 
equivalence ~ which are called types of conflicts, e.g. (01402) ~ 402x101) ~ 101x402) x402) 102) x< 
{61} x {01} x {01}, etc. The minC combination works with these classes of equality (types of conflict) 


instead of the set of all different conflicts. For more details see M]. 


3 Potential contradictions in the original terminology of 
4A real contradiction in [2] [4]. 
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The conflicts are considered ” per elements” in the following way: conflict {01,62} x {03} is considered 
as a set of elementary conflicts 4101) x {03}, (02) x {@3}}, i.e. set of conflicts between/among single- 
tons. Analogically, potential conflict {61,62} x {02,03} is considered as a set of elementary conflicts 
{{01}x {Oa}, {01} x{O3}, {02}, {02}x{O3}}, where {02} ~ (02) x {02} is so called trivial conflict) i.e. no 
conflict in fact. Note that any partial conflict contains at least one trivial conflict. The set of elementary 
conflicts is constructed similarly to the Cartesian product of conflicting sets, where {01} x {62} x... x 101) 
is used instead on n-tuple [91,02,..., Ox]. As the above equivalence ~ of elementary conflicts is used, we 
have elementary conflicts of different n-arity in the same set, thus we do not use n-tuples as it is usual in 
the Cartesian product. The idea of ” conflicts per elements” was generalized also for non-conflicting sets 


in the "combination per elements”, see [3}. 
P > 


For further decreasing of the number of types of conflicts we consider only minimal conflicts in the 
following sense: (01) x 102), {03}, are minimal conflicts of the set (101) x 402), {03}, (01) x 102) x {03}, 
{61} x{02}x{O4}x {05}, {01}x{03}x{O5}}; i.e. the set of singletons contained in a minimal conflict is mini- 
mal from the point of view of inclusion among all sets of singletons corresponding to elementary conflicts. 
Thus {{41}x{62}, {Os} } ~ (1013102), 103), {Ar} x {A2}x {Os}, {01} x {02} x {4} x {5}, {91} x {03} x 105). 
Our concentration only to minimal conflicts brings us a simplification, which is closer to Shafer’s model, 


and it has no influence on associativity of combination. 


In this way we obtain 8 types of conflicts ((01)x102), (01403), (021x403), {01}x{02}x{O3}, 1401 px 
1023, {O1}x{Os}}, (101102), {O2}x{Osh}, (101 9L03), 10291033), (101402), 101 (03), {82 }x{O3}}) 
and 3 types of potential conflicts (((01), {02} x {03}}, {102}, (01) x {O3}}, (103), {01} x102))) in a 
3D case O = {01,02,03}. Together with 7 non-conflicting subsets of O we have 18 sets of conflicts to 
which nonnegative bbms can be assigned in the 3D case, or 18 elements of a generalized 3D frame of 


discernment. 


10.3.2 Combination on generalized frames of discernment 


As minC combination has a nature of a conjunctive rule of combination, m (X)m2(Y) is assigned to 
X AY, if it is non-empty, or to X xY otherwise. More precisely the least representative of the type of 
conflict of X x Y is considered instead of X x Y. It is unique but an order of elementary conflicts and 
an order of elements inside elementary conflicts. A fixation of these orders enables a unique selection of 
representatives of ~ classes of conflicts. A complete 18x18 table of minC combination for 3D is presented 
in BJA. We include here only an illustrative part of it, see Table [0.1] The resulting value m°(Z) of the 
generalized bba is computed as a sum of all m,(X)ma(Y') for which the field of the complete table in the 


5A trivial contradiction. 
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row corresponding to X and column corresponding to Y contains Z. In other words, generalized m°(Z) 
is computed as a sum of all m¡(X)ma(Y') for which Z = XAY if (X CY)V(Y¥Y CX) or Z~ XxY 
otherwise, where ~ is the equivalence of conflicts from the previous subsection (Z and X xY are in the 


same ~ class of conflicts.); i.e. 


m(Z)= Y. mi(X)ma(¥)+ Y. mi(X)m2(¥). (10.1) 
Z=XNY Zw XxXY 
XCYVYCX XG£Y&YEX 


In order to decrease the size of the table below, the following abbreviations are used in this table: 
A stands for {A}, similarly AB stands for {A,B}, and ABC stands for {A,B,C}, A x B stands for 
{A} x {B}, similarly A x BC stands for {A} x {B,C}, x stands for {A} x {B} x {C}, OA stands for 
{A}, and O stands for (4, B} x {A,C} x {B,C}, and similarly. 
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Table 10.1: A partial table of combination of 2 generalized BFs on O = {A, B,C}. 























The minC combination is commutative and associative on generalized BFs. It overcomes some dis- 
advantages of both Dempster’s rules (normalized and un-normalized). This theoretically nice combining 


rule has however a computational complexity rapidly increasing with the size of the frame of discernment. 
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10.3.3 Reallocation of belief masses of conflicts 


Due to the belief masses being assigned also to types of conflicts and partial conflicts, the result of the 
minC combination is a generalized belief function even if it is applied to classical BFs. To obtain a 
classical belief function on Shafer’s model we have to do the following two steps: we first reassign the 
bbms of partial conflicts to their non contradictive elements and then we proportionalize bbms of pure 
(real) conflicts. Because of a different nature of pure and partial conflicts, also these two steps of bbms 


reallocation are different. 


10.3.3.1 Reallocation of gbbms of partial conflicts 


Gbbms of partial conflicts (potential contradictions) are simply reassigned to the sets of their trivial 
conflicts, i.e. to the sets of their non-contradictive elements (e.g. m°({6;, 0; } x (6;, 01 )) is reallocated to 
{0;}). We denote resulting gbba of this step with m! to distinguish it from gbba m° on the completely 
generalized level. Thus we obtain m'({6;,0;} x {6;,0x}) = 0 and m!({6;}) is a sum of all m°(X), where 
{6;} is maximal nonconflicting part of X. Nothing is performed with gbbms of pure conflicts in this step, 


hence m*(Y) = m°(Y) for any pure conflict Y. 


10.3.3.2 Proportionalization of gbbms of pure conflicts 


Let us present two ways how to accomplish a proportionalization of gbbms which has been assigned by 
m? to pure (real) conflicts . The basic belief mass of a conflict X x Y between two subsets of O can be 


proportionalized, i.e. reallocated according to the proportions of the corresponding non-conflicting bbms: 
a) among X,Y, and XUY as originally designed for so called proportionalized combination rule in [I]. 


b) among all nonempty subsets of X UY. This way combines the original idea of proportionalization 


with the consideration of conflict ” per elements”. 


For a conflict X of several subsets of a frame of discernment X1, X2,..., Xp C O, e.g. for (01) x {02} x {03} 
and ~N (101) x {62}, 101) x {63}, {62} x {O3}} ru (01, 02) x (01, 03) x (02, 03) in 3D and further conflicts 














from nD case, we have to generalize the above description of proportionalization in the following way. 


The bbm of contradiction X = X1 x Xə x... X Xz can be proportionalized: 
a) among all unions Ul Xi of j < k sets X; from {X1, Xo,..., Xk}. 
b) among all nonempty subsets of X; U X2 U ... U Xp. 


For an explicit expression, the conflicts of the subsets of 3D © = {6}, 02,03} should be proportionalized 
among, see Table [0.2] The bbms of conflicts in the first column should be proportionalized by the 
proportionalization ad a) among sets in the second column and by the proportionalization ad b) among 


the sets in the third column. 
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If gbbms m1(X;) = 0 for all X; then we divide the proportionalized gbbm m!(X1 x Xə x ... X 
Xy) by number of the sets among them the gbbm should be proportionalized, i.e. by 2* — 1 in the 


proportionalization a) and by 2” — 1, where m = |X, U X2U... U Xz] in the case b). 










{01} x {2} 101), 102), 101, 02) 101), 102), 101, 02) 


101) x 102,03) 101), 102,03), 101, 02, 03) P((01,02,03)) — 0 
101,02) x {01, 03} x (02,03) | {91,92}, 101,03), 102,03), 101,02, 03} P((01,02,03)) — 0 


ORORO P((01,02,03)) — 0 P((01,02,03)) — 0 


Type of conflict | Proportionalization ad a) Proportionalization ad b) 


Table 10.2: Proportionalizations on a 3D frame of discernment 


A proportionalization of the types of the conflicts from the Table is the same even if (01, 02,03) Ç O. 
Hence we can see from the Table that the proportionalization is something like ‘local normalization’ on the 


power set of O” Ç © in the case b) or on a subset of such power set. E. g. m'({01 }x{62,03 }) is proportional- 


m*({61}) 


ized with proportionalization a) among {61}, (02, 03), {61, 02,03) so that OPE oe, Bat) tnt Orbe BaF) 


mi . . 
m1 ({0,}{02,03}) is assigned to (01), — OCHRE ET ERA EUITIO) ml ((01) 102,031) is assigned to 
m! . . . 
{62,03}, and aoe EE he m1 ((01)x402,03)) is assigned to {01,02,03}. Analogically 


m! (02.09 )) i ad 
ANAND AmO Nina M0) 102,033) is assigned 


to {02,03} with proportionalization b), and similarly for other subsets of {01, 02,03}. For single elemen- 
tary conflicts both the proportionalizations coincide, see e.g. the 1st and the 4th rows of the Table [0.2] 
Specially there is the only proportionalization in the 2D case because, there is the only conflict and it 
is an elementary one. This proportionalization actually coincides with the classical normalization, see 


examples in Section [0.5] 


Let us remember that neither the reallocation of gbbms of partial conflicts nor the proportionalization 
does not keep associativity of minC combination of the generalized level. Hence we have always to keep in 
the consideration and to save the generalized version of the result to be prepared for a later combination 


with another belief. 


10.3.4 Summary of the idea of the minC combination 
We can summarize the process of the minC combination of n > beliefs as follows: 
1. we apply (n — 1) times the generalized version of minC, to compute gbba m/, see formula (10.1); 


2. after we once apply a reallocation of gbbms of the partial conflicts to produce gbba m! and finally 


we once apply the proportionalization a) or b) to obtain the final bbm m. If we want to keep as 
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much as possible of associativity for future combining, we have to remember also the gbbm m? and 


continue further combination (if there is any) from it. 


10.4 Comparison 


10.4.1 Comparison of generalized frames of discernment 


As has been already mentioned in the introduction of this chapter, DSm and minC rules of combination 
arise from completely different assumptions and ideas. On the other hand, 18 different subsets of a frame 
of discernment and types of conflicts and potential conflicts (7+8+3) in 3D case or 18 elements of a 
generalized 3D frame of discernment correspond to 18 non empty elements of hyper-power set D® in the 
free DSm model. Moreover, if we rewrite subsets of the frame of discernment, e.g. {0;, 0j, 0k}, and sets of 
elementary conflicts as unions of their elements, e.g. {6;, 0j, Ok} ~ 0iU0;UOk, and conflicts as intersections, 
e.g. {Oi} x {Oj} ~ N0, 105, 0) x {0i On} (0,005) (01004), (1051055, LO, PLO}, 10A Ak} ~ 
(0i NO 05) U (0; N 0k) U (9, N 0j N Ok), then we obtain the following: 

101) ~ 01 = ag 

{02} ~ 02 = 010 

103) ~ 03 = 011 

101,02) ~ 01 U 02 =015 

101,03) ~ 01 U 03 =016 

102,03) ~ 02 U 83 = a17 

{01,02,03} ~ 01 U 02 U 3 = aig 

{01} x {82} ~ 01 N 02 = a9 

{01} x {03} ~ 01963 = as 

{02} x {03} ~ 02003 = a4 








{1} x 102, 83} = (101) x {02}, 101) x {Os} } ~ 01 N (02 U 03) = a7 
{02} x {81,03} = (101) x 102), 102) x {Osh} ~ 02 N (01 U 83) = a6 
103) 101,02) = {{O3} x {01}, 103) x102)j ~ 03 N (91 U 02) = a5 


{01} x {02} x {03} ~ 61102703 =a 
{{1} x {92}, {01} x {03}, {02} x {Ost} ~ (01 G2) U (01 N 63) U (01 N 03) = ag 
01 = (101), {02} x {O3}} ~ 01 U (82 N 03) = a14 
02 = (102), {01} x {03} } ~ 02 U (91 G3) = 013 
03 = {{O3}, {01} x {O2}} ~ 3 U (01 2) = 012. 


Thus a generalized frame of discernment from the minC approach uniquely corresponds to DÈ — 0. 
































Hence the minC approach is an alternative way how to generate Dedekind’s lattice. 
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10.4.2 Comparison of principles of combination 


For bbms of two non-conflicting sets X,Y C O both the minC and the DSm rules assign the product of 
the belief masses to the intersection of the sett If one of the sets (or both of them) is (are) conflicting, 
then the minC combination assigns the product of their bbms to the conflict X x Y. Similarly as above, 
we can consider this conflict as an intersection X NY. We should verify whether X NY really corresponds 


to the corresponding field of the minC combination table. 


As first example, let's denote by definition A; = {01,63} x ({03} x [01,02)), then one has 


Aj ~ (01 U 03) N (03 N (01 U 02)) = (61 N (03 N (81 U 62) U (83 N (83 N (81 U 02))) 
= (03 N (01 N (01 U 82))) U (93 N (9, U 02)) = (03 N 01) U (83 N (81 U 02)) = (93 N (9, U 02)) 


~ {03} x (01,02) 


As second example, let's denote 42 £ ({01} x {02} x {03}) x {{01} x {02}, {101} x {03}, {02} x {03}}, then 


one has 


Az ~ (01 N 02 N 03) x ((01 N 02) U (01 N 03) U (02 M 93)) 
~ (619 82 N 63) N ((81 N 62) U (61 N 63) U (82 N 03)) 
= 61 N 02 N 03 N ((81 N 62) U (81 N 63) U (82 N 03)) = (9, N 82 N 63) U (81 N 02 N 63) U (01 N 92 N 03) 


= 01 N 82 N 03 ~ {01} x (02) x (03) 











As third example, let's denote A3 = O{61} x (01 x {02,03}), then one has 





Az = {{91}, 102 x O3}} x (01 x {62, A3}) 
~ (01 U (82 N 03)) N (81 N (2 U 83)) = (01 U (82 N 83)) N (01 N (92 U 03)) 
= (01 N (01 N (82 U 83))) U ((82 N 83)) A (61 N (82 U 03)) 
= (61 N (92 U 63)) U ((82 N 63) N (81 N (92 U 63))) 


= 91 N (62 U 83)) U ((82 N 03 N 81 N 62) U (02 N 63 N 01 N 83)) 








= (01. (02 U 03)) U ((01 N 82 N 03) U (01 N 62 N 03)) 

















= (AN (02 U 93)) U (01 N ba N 03) = (01 N (02 U 03)) ~ (01 x {62, 03}) 


6We have to mention here that the minC combination rule has never been formulated as a k-ary operator for combination 
of k > 2 belief sources, analogically to the DSm combination rule, see Equations (12) and (45). Nevertheless, it is 
theoretically very easy to explicitly formulate it similarly to the DSm rule for k sources. Moreover, because of its associativity 
on the generalized level we can obtain the same result by step-wise ((k—1)-times) application of the binary form, and continue 


with reallocation of bbms of conflicts as is usual. 
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In the case of {61,03} x {01, 02} N (8, U83) x (01u02) N (9,U83)n(01U02) = (9,n(91U62)U(03nN(91U92) = 
(9,n6,U68,n62)U(63nN6,U8¿N02) = (61)U(63N61)U(03N92) = (01)U(93N02) ~ {101}, {02x 03}} ~ 0401} 














we can show again that minC combination of bbms of sets (01,03), {01, 02} corresponds to the intersec- 
tion of the corresponding elements of D®: (0, U 03) and (01 U 62), i.e. to 01 U (03 N 62). Moreover, 
this shows a rise and the importance of a partial conflict (or potential contradiction) between two 
sets with non-empty intersection (01,03) N {01,02} = {01} in Shafer’s model. This intersection (01) 


which is used in Dempster’s rule, is different from the generalized minC and the free DSm intersection 








{61, 43} M {61, 02) ~ (01 U 03) N (01 U 02) = (91) U (03 N 92) ~ (01) on the generalized level. 








Analogically we can verify that all the fields in the complete minC combination table uniquely cor- 
respond to intersections of corresponding sets. For a general nD case it is possible to verify that the 
similarity relation ~ on conflicts corresponds with properties of the lattice {O,9,U}. Thus the minC 


combination equation (10.1) corresponds with the classical DSm combination equation (11). 


Hence the minC abina on a generalized level fully corresponds to the DSm combination rule 


on a free DSm model. 


10.4.3 Two steps of combination 


Because minC is not designed for the DSm model but for the classical Shafer’s model, we have to compare 
it in the context of the special Shaferian case of the hybrid DSm rule. According to the present develop- 
ment state of the hybrid DSm rule, see Chapter A] in the first step all the combination is done on the free 
DSm model — it is fully equivalent to the generalized minC combination — and in the second step con- 
straints are introduced. The second step is analogous to the reallocation in the minC approach. It does 
not explicitly distinguish anything like partial conflicts and pure conflicts, but analogically to the minC 
combination, bbms are reallocated in two different ways. An introduction of constraints can joint two or 
more elements of DÈ, e.g. see Example 4 in Chapter [A] where the element ag is joined with the element 
014, and the elements ajo, and ay, are joined with a 3 and 012 respectively. Gbbms of such elements 
are actually reallocated within this process. Really, the gbbms mmr (ag), Mayr (Qio), and mmr (a11) 
are reallocated to m mo(@14), muo(a13) and myo(a12) respectively, as an analogy of the reallocation of 
partial conflicts in the minC approach. We can verify that the elements ag, aig, 11 really correspond 
to the partial conflicts of the minC approach. The step 2 consists further in grouping of all empty sets 
together and in the reallocation of their bbms. This action fully corresponds to a proportionalization of 


pure conflicts in the minC approach. 


“For a comparison of the minC combination with other approaches for combination of conflicting beliefs, see [8]. 
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Hence, the only principal difference between the minC and the DSm combination rules consists in 
reallocation of the bbms of conflicting (or empty) sets to non-conflicting (non-empty) ones, i.e. to the 
subsets of the frame of discernment, because the reallocation performed in the 2nd step of the hybrid 


DSm combination does not correspond to any of the above proportionalizations used in minC either. 


10.4.4 On the associativity of the combination rules 


As it was already mentioned both the DSm rule and the minC combination rule are fully associative on 
the generalized level, i.e. on the free DSm model in DSm terminology. Steps 2 in both the combina- 
tions, i.e. the introduction of constraints in DSm combination and the reallocation of conflicts including 
both the proportionalizations, do not keep associativity. If we use results of combination with all the con- 


straints as an input for another combination, we obtain suboptimal results, see Section{4.5.5]in Chapter] 


In order to keep as much associativity of the combination on the generalized level as possible, we have 
to use n-ary version of DSm rule. In the case where k input beliefs have been already combined, we have 
to save all the k input belief functions. If we want to combine the previous result with the new (k + 1)th 
input mz41, then we have either to repeat all the n-ary combination for k + 1 inputs this time, or we 
can use the free DSm result of the previous combination (the result of the last application of the Step 1) 
and apply the binary Step 1 to combine the new input (we obtain the same result as with an application 
of n-ary version for k + 1 inputs). Nevertheless, after it we have to apply n-ary version of the Step 2 for 


introduction of all constraints at the end. 


There is another situation in the case of the minC combination. Because we consider only minimal 
conflicts, the result of the Step 2 depends only on the generalized result m? of the Step 1 and we need 
not the input belief functions for the reallocation of partial conflicts and for the proportionalization. The 
non-normalized combination rule including the generalized one, provides the same result either if n-ary 
version is used for k inputs or if step-wise k — 1 times the binary version is applied. Hence binary version 
of the generalized minC combination and unary reallocation satisfy for the optimal results in the sense 
of Chapter] If we already have k inputs combined, it is enough to save and store only the generalized 
result instead of all inputs. We perform the generalized combination with the input mz+1 after. And in 
the end we perform Step 2 for obtaining classical Shaferian result. Of course it is also possible to store 


all the inputs and to make a new combination, analogically, to the DSm approach. 


10.4.5 The special cases 


Specially in the 2D case minC corresponds to Dempster’s rule — there is only one type of conflict and 


both the presented proportionalizations a) and b) coincide with normalization there. While the 2D DSm 
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corresponds to Yager’s rule, see [12], where m(X)m2(Y) is assigned to X NY if it is non-empty or to O 
for XAY = Í, and it also coincides with Dubois-Prade’s rule, see [7], where m1(X)m2(Y) is assigned to 
XAY if it is non-empty or to X UY otherwise. To complete the 2D comparison, it is necessary to add 
that the classical DSm combination rule for the 2D free DSm model corresponds to the non-normalized 


Dempster’s rule used in TBM. For examples see Table [0.3] in Sectior{L0.5) 


In an nD case for n > 2 neither the minC nor DSm rule correspond to any version of Dempster’s or 
Yager’s rules. On the other hand the binary version of the hybrid DSm rule coincides with Dubois-Prade’s 
rule on Shafer’s model, for an example see Table [10.6/in Sectior10.5 


10.4.6 Comparison of expressivity of DSm and minC approaches 


As the minC combination is designed for combination of classical belief functions on frames of discern- 
ment with exclusive elements, we cannot explicitly express that 2 elements of frame have a non-empty 
intersection. The only way for it is a generalized result of combination of 2 classical BFs. On the other 
hand, even if the hyper-power set DY has more elements than the number of parts in the corresponding 
Venn’s diagram, we cannot assign belief mass to 0, but not to 02 in DSm approach. I. e. we cannot 
assign bbms in such a way that for generalized pignistic probability, see Chapter [Z] the following holds: 
P(6,) > 0 and P(02) = 0. The intersection 91 N 62 is always a subset both of 6; and 02. Hence from 
m(6,) > 0 we always obtain P(01 N 02) > 0 and P(02) > 0. We cannot assign any gbbm to 6; — 62. The 
only way how to do it is to add an additional constraint 01 N 02 = Ø, but such a constraint should be 
applied to all beliefs in the model and not only to one or several specific ones. As Shafer’s model has 
already all the exclusivity constraints, the above described property is not related to it. Hence both the 
DSm approach and the minC combination have the comparable expressivity on Shafer’s model. The DSm 


approach utilizes, in addition to it, its capability to express positive belief masses of the intersections. 


10.5 Examples 


In this section we present a comparison on examples of combination. The first 2D example simply 
compares not only the DSm and minC combination rules but also both the normalized and non-normalized 
Dempster’s rule, Yager’s rule, and Dubois-Prade's rule of belief combination, see Table[I0.3] Because the 
proportionalizations a) and b) coincide in the 2D case, and subsequently the corresponding bbas mo and 
mo, also coincide, we use m"""C for mo = mo. This example enables us to make a wide comparison, 


but it does not really discover a nature of the presented approaches to the belief combination. For 


this reason we present also a more complicated 3D example, see Tables [10.4] and [0.5] which show us 
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how conflicts and partial conflicts arise during combination, how constraints are introduced, and how 


proportionalizations are performed. 


aoa EE 


Ta = Wy Joana oar[oar or foam 
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Table 10.3: Comparison of combination of 2D belief functions 


Table[10.4] provides a comparison of combination of 3D belief functions based on the free DSm model 
with the classic DSm rule and on Shafer’s model with the hybrid DSm rule. The 5th column (mit?) 
gives the result of the combination of the sources 1 and 2 obtained with the classic DSm rule based on 
the free DSm model. The 7th column (mM ) gives the result of the combination of the sources 1, 2 and 
3 obtained with the classic DSm rule based also on the free DSm model. Column 6 (mM J presents the 
result of the hybrid DSm combination of sources 1 and 2 based on Shafer’s model M°. Column 8 (mi4 ) 
presents the result of the hybrid DSm combination of sources 1, 2 and 3 based on Shafer’s model M°. 
Column 9 and 10 shows the results obtained when performing suboptimal fusion. O stands for the DSm 
rule on the free DSm model and blank fields stand for 0. 


0 


Table[10.5]presents the results drawn from the minC combination rule. m? corresponds to the gbba on 


the generalized frame of discernment, m! to the gbba after reallocation of bbms of partial conflicts, m% 


to the bba after proportionalization a) and mè) to the bba after proportionalization b). m denotes 


(mL @ms)?, and Masog denotes (mms), where @ stands for the generalized minC combination, 


blank fields stand for 0. 


M963 


Table[T0.6]presents the results of several rules of combination for 3D belief functions for sources 1 and 2 
on Shafer’s model, i.e. on the hybrid DSm model M? (for the source bbas m1, m2, and mg see Table[T0.4). 
mo corresponds to the bba of the minC combination (the minC combination of mı and ma or m1, M2 


and m3 respectively) with proportionalization a); m?) corresponds to the bba of the minC combination 


with proportionalization b); mv corresponds to the bba of the DSm combination. mTM corresponds 


Y 


to the bba of the combination with the TBM’s non-normalized Demspter’s rule; m* corresponds to the 


DB 


bba of the Yager’s combination; m corresponds to the bba of Dubois-Prade’s combination and m® 


corresponds to the bba of the normalized Dempster’s combination. 
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Table 10.4: Comparison of combination of 3D belief functions based on DSm rules of combination. 





We can see that during the combination of 2 belief functions a lot of types of conflict arise, but some 














of them still remain with 0 bbm (a, ~ x and ag ~ O). We can see how these conflicts arise when the 
3rd BF is combined. We can see the difference between the combination of 3 BFs on the generalized 


level (see m%33) and the suboptimal combination of the 3rd belief with an intermediate result to which 


constraints have already been introduced (see (m2S™” Om)” and (mi), emgz)?). We can see how the 
gbbms are reallocated among the subsets of O during the second step of minC combination and finally 


how the gbbms of all pure conflicts are reallocated in both ways a) and b). 


The final results of DSm and minC combinations are compared in Table [10.6] We can note that 
the small subsets of O (singletons in our 3D example) have greater bbms after the minC combination 
while the great sets (2-element sets and namely whole {61, 62,03} in our case) have greater bbms after 
application of the DSm combination rule. I. e. the DSm combining rule is more cautious than the minC 


combination within the reallocation of the conflicting gbbms. Thus we see that the minC combination 
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Table 10.5: Comparison of combination of 3D belief functions with the minC rule. 






































rule produces more specified results than the DSm rule does. The last three columns of the table show 
us that the DSm and the minC with both the proportionalizations produce results different from those 
of Yager's rule and of both the versions of Dempster’s rule (see mY, mT™MB, and m® respectively). 
While binary DSm result on Shafer’s model (M°) coincides with the results of Dubois-Prade’s rule of 
combination. 

Let us present numeric examples of parts of computation m°, mt, m%, and m») for readers which 
are interested in detail. We begin with a non-conflicting set {01,62}, ie. with 15 = 01 U 02 in the DSm 
notation. It is an intersection with itself or with the whole O = {61, 2,03} (i.e. 01 U02 U03 in DSm), and 
it is not ~ equivalent to any other element of DP. Thus m}, (01 U2) = mı (01 U02)ma(01, U02) + m1 (41 U 
02) m2 (61 U62U83) +71 (01 U8, U3) m2 (61 U62) = 0.1-0.0+0.1-0.3+0.0-0.2 = 0.00+0.03+0.00 = 0.03. ais 
is a non-conflicting element of DY, hence it is not further reassigned or proportionalized, i. e. its bbm will 


not be decreased. «5 is not a non-conflicting part of any other element of DÌ, thus mt (a15) = ml2(a15). 
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Table 10.6: Comparison of combinations of sources 1 and 2 on Shafer's model (i.e. on the hybrid DSm 


model M0). 








m$} (as) > mt, (a15) because gbbms of some other elements are proportionalized, among others, also to 


0115. For the same reason it holds also m? (015) > ml, (a15). 












































A potential conflict O0{01} ~ (01 U 02) N (81 U03) = a14 is equivalent to O0{01} x 0{01}, to 0{01} x X, 














and to X x Of{6,}, where {01} C X in Shafer’s model, see Table [0J] or ay4 = (01 U 62) N (01 U 03) 
is an intersection of itself with X, where 14 C X C 6; U 62 U 03 in the DSm terminology. Le. 
mÌa (a14) = m? (81 N (92 U 63)) = mi(a14)ma(a14) + Mı (01 U 02)M2(01 U 03) + mı (81 U03)ma(01 U 02) + 
mi(a14)(m2(01U02)4+m2(01U03) +m2(0,U02U03))+(m1 (01U092)+m1(01U03)+m1(0,U02U03))m2(0:14) = 
0.0-0.0+0.1-0.1+0.1-0.0+0.0-(0.1+0.1+0.2)+(0.0+0.14-0.3)-0.0= 0+0.01+0+0+4+0 = 0.01. ag = (01) 








is a non-conflicting part of 01 N (02U63), thus m? (a14) is reallocated to 01. On the other hand (01) is not a 
non-conflicting part of any other element of DÌ, hence m!(ag) = m? (ag) +m%(a14) = 0.19 +0.01 = 0.20. 


After this reallocation, the bbm of 014 equals 0, hence m! (a14) = M® (a14) = m» (a14) = 0. 


A pure conflict {01} x {02,03} ~ 01 N (02 U03) = az is contained in 24 fields of the full minC combina- 
tion table (for its part see Table[T0.1), e. g. in the fields corresponding to {A} x ({A} x {B,C}), {A} x 
{B,C}, {A, B} x ({A} x {B,C}), but only some of them correspond to the Shaferian input beliefs (i. 
e. only some of them are positive). Thus m*(a7) = m°(az) = m1(01)m2(02 U 63) + m1 (02 U 63)ma(01) = 
0.3-0.2+0.1-0.0 = 0.06 + 0.00 = 0.06. As a7 is a pure conflict, thus its bbm is not changing dur- 
ing the reallocation substep, and it is proportionalized among {61}, {62,03}, {61, 92,03} with the pro- 
portionalization a), and among all the subsets of O = {081,02,03} with the proportionalization b). 
Thus m! (07) - aoe am = = 0067700 = 0.06535 = 0.040 is reassigned to 


ee m? (02003) 04 _ 0.04 : : 
0 = a9; m (ar): SEEING) = O 5 =0Foviroos = 9.06535 = 0.008 is reassigned 
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Cn bec 1 m’ (61023) > 0.06 = 0.06 _ : 
to 92 U 03 = a17; and m*(07) - ODA OO) m 0000) = 0.0650 07000 = 0-067350 = 0.012 is 
reassigned to 01 U 02 U 03 = aig with the proportionalization a). As belief masses 0.05 0700 = 


0.05 - 0.5 = 0.0250 and 0.07 = 0.07 - 0.4762 = 0.0333 are analogically proportionalized with 


0.20 

0:20+0.16+0.06 
the proportionalization a) also to 01, so we obtain m3 (0) = m!*(9,) + 0.040 + 0.0250 + 0.0333 = 
0.2000 + 0.040 + 0.0250 + 0.0333 = 0.2983. A value m?) (01) is computed analogically; where e.g. 


0.06 = 0.06222 = 0.06 - 0.2777 = 0.0166 is proportionalized from m!(a7). 


0.20 
0.20+0.17+0.16+0.03+0.06+0.04+0.06 0.72 


10.6 Conclusion 


In this chapter we have compared two independently developed approaches to combination of conflicting 
beliefs. Motivations and the starting points of the approaches are significantly different. The classical 
frame of discernment with mutually exclusive elements is the starting point for the minC combination, 
whereas the free DSm model is the starting point for the classical DSm approach. The approaches were 
originally rather complementary than comparable. 

Surprisingly, the internal combining structures and mechanisms of both these combination rules are 
the same and the results of the classical DSm rule for the free DSm model are the same as the intermediate 
results of the minC combination on a generalized frame of discernment. Nevertheless, this common step 
is followed by reallocation of the belief masses temporarily assigned to conflicts to obtain classical belief 
functions as results in the case of the minC combination. 

After the recent development of versions of the DSm rule for Shafer’s model and for general hybrid 
DSm models, which consider 2 steps of combination, the minC combination becomes an alternative to 
the special case of the DSm combination rule for Shafer’s model. 

The first step — a combination on a generalized frame — is the same again. Also a reallocation of 
the generalized basic belief masses of potential conflicts is analogous. The main difference consists in 
different reallocations of the generalized basic belief masses (gbbm) of pure conflicts: it is a reassigning 
of the gbbms to the union of the corresponding sets in the DSm rule, whereas a proportionalization in 
the minC approach. 

In spite of this difference, we can also consider the DSm introduction of constraints as an alternative 


to a reallocation of the belief masses of conflicts in the minC approach. 
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Denis de Brucq 
Académie des Sciences, Belles-Lettres et Arts de Rouen 


Rouen, France. 


Abstract: This chapter presents new important links between the most important 
theories developed in literature for managing uncertainties (i.e. probability, fuzzy 
sets and evidence theories). The Information fusion introduces special operators o 
in the probability theory, in the fuzzy set theory and in the theory of evidence. The 
mathematical theory of evidence and the fuzzy set theory often replace probabilities 
in medicine, economy and automatics. The choice between these three quite distinct 
theories depends on the intrinsic nature of the data to combine. This chapter shows 
that same four postulates support actually these apparently distinct theories. We 
unify these three theories from the four following postulates: non-contradiction, con- 
tinuity, universality, context dependence and prove that a same functional equation 
is supported by probability theory, evidence theory and fuzzy set theories. In other 
words, the same postulates applied on confidences, under different conditions, either 
in the dependence or independence situation, imply the same foundation for the var- 
tous modern theories of information fusion in the framework of uncertainty by using 
deductions that we have unified. The independence between elementary confidences 


have not to be understood in the sense of probabilistic meaning. 
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11.1 About uncertainty 


n medical fields as in economics and control, one notes the limitation of the additive probabilities due 
i the too strong constraints imposed. The modification of basic axioms to overcome these limitations 
leads to different numerical theories and one finds approaches such as fuzzy set theory. By considering 
the notion of lower probabilities and upper probabilities, one obtains the credibility and the plausibiliy 
functions of Dempster-Shafer’s theory of evidence [6]. The 60’s has seen the development of theories that 
are not directly linked to probabilities. For instance, Zadeh invented fuzzy set theory in 1965 [I5]; he 
then created the possibility theory in 1978 [T6]. 


With the four postulates, which are the basis of the machines on confidences without adding the 
additivity postulate that leads to probabilities and by considering the independence of the achievement 


of these confidences, we obtain the fuzzy set theory. 


In fact, we have observed that both basic equalities of information fusion are two continuous, com- 
mutative and associative operations on confidences. Let © be a discrete body of evidence called frame of 


discernment. Thus, both combinations can be written in terms of probabilities: 
VA,B CO, P(ANB) £ P(A) P(B/A) 2 P(B) P(A/B) 
and in term of membership functions: 
VA, B C © —= pansla) Ê palo) A pa(a) 


These two operations had to verify the same basic postulates required to model data fusion. 


When analyzing imprecise and uncertain data, all the usual techniques must be changed. It is a fact 
that logic is only an abstract construction for reasoning and physical laws are only models of material 
system evolutions. Nothing proves that logic can describe correctly all fusions. Moreover, imprecise and 
uncertain analyses as in this chapter show that an infinity of fusions are possible. From the principles of 
this chapter, it is possible to introduce a fusion denoted by the operator o with any increasing function 
from [0,1] onto [0,1]. More precisely, with two beliefs x,y instead of the product x * y to describe the 
fusion we write x o y. For example instead of the probability P(A N B) = P(A)P(B) of the intersection 
AN B of two independent sets A, B, we write the belief [A and B/e] = [A/e] o [B/e], the fusion o of the 


two beliefs [A/e] and [B/e]. Any equation of this book may be changed with this transformation. 


Moreover, the hypothesis that the sum of masses of disjoint sets is equal to 1 is a global hypothesis 


and seems to be hazardous. 
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We demonstrate that the fusion operation o is mainly described by a simple product after transfor- 
mation. This previous transformation of confidence c(A) = [A/e] on A in the environment e is made by 
using a continuous and strictly monotone function w. This result is easily understood by comparing the 
transformation w with the Fourier transformation. The latter transforms the composition product of two 
functions into the product of their Fourier transform. We observe that convolution is commutative and 
associative. Similarly, Demspster-Shafer fusion is also commutative and associative. Communality of a 
fusion is the simple product of the communalities of the sources. Without commutativity or associativity 


other developments are necesary. 


11.1.1 Probabilistic modelling 


The probability theory has taken a leap during the 17!” century with the study of games for luck calculus. 
The ultimate objective of probability theory is the study of laws governing the random phenomena, that 
is the presence of uncertainty. For many years, probabilistic methods have generated many debates, in 
particular among defenders of the frequentist approach, the objective approach and the subjective ap- 
proaches. Historically, the formulation of the axiomatic basis and the mathematical foundation of the 


theory are due to Andreï Kolmogorov in 1933. 


Let an uncertain experiment be described by the sample space Q whose elements, denoted w are the 
possible results of that experiment. Let A € P (Q) be subset of Q. The subset A is a random event for 
this theory and the event is said to occur when the result w of the experiment belongs to A. The collection 
of all the subsets of 2, P (Q), cannot always be associated to the set A of possible random events in Q. 
For logical coherence purposes, one restricts A to a o-algebra, a subset of P (Q) which is closed under 
countable union and under complement. Thus, the pair (Q,.4) is a measurable space and a probability 


measure P over (Q, A) is then a positive real-valued function of sets with values in [0,1] and defined over 


A. 


Definition 1. A probability measure P over (Q, A) is an application of A with values in [0,1] satisfying 


the following axioms (Kolmogorov’s axioms): i) For all A € A 
0< P(A) <land P(Q)=1 (11.1) 
it) (additivity) For any finite family {A;,i € I} of mutually exclusive events, we have: 
P (Ua) = Y > P(A) (11.2) 


iii) sequential monotonic continuity in@ For any sequence { An, n > 1} of events decreasing to the empty 


set Ý that is Ay D A2 D A; D ... and N An =0 , we have 


lim P (A) = 0 (11.3) 
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P (A) characterizes the probability that the event A occurs. If P is a probability measure on (Q, A), 


the triple (Q, A, P) is a probability space. From the previous axioms, one easily deduces the following 


properties: 
Ay C Ap => P (41) < P (A3), (11.4) 
P (0) =0, (11.5) 
P(A) =1-P(A), (11.6) 
P(A, U A2) = P (A1) + P (42) — P (A1 Ag). (11.7) 


The conditional probability is one of the most useful notions in probability theory. In practice, it is 
introduced to allow reasoning on events of a referential. For instance, in the case of an exhaustive draw, 
it is concerned with the probability of an event A, under the condition that an event E occurs. The 
random event E represents the environment that is usually expressed as E = e. There is no reason for 


having symmetry between event A and the environment e. 


Definition 2. Let (Q, A, P) be a probability space, the conditional probability P(A/E) of an event A 
given E such that P (E) > 0 is defined as: 


P(ANE) 


P(A/E) = Sop 


(11.8) 





If P(E) =0, this definition has no sense. If AC E then P(A/E) = pt. and one has P (E/E) =1. 


Obviously, the conditional probability P (A/F) will be seen as the probability of A when E becomes 


the certain event following additional information asserting that E satisfies to (P (E) = 1). 


The equation (ILS) is generalized by using the well known Bayes’ theorem. If one considers an event 
E of which we can estimate, a priori, the probability (P (E) 4 0) and a finite partition (Hy,..., Hn} of Q 
(set of mutually exclusive hypotheses describing n modalities of the realization of E). The Bayes’ formula 
then yields: 
P (E/H;) P (Hi) 


~ n 


P (H/E) 
P(E/H;) P (H;) 


(11.9) 


j 
The conditional probabilities (1.9) allow the modification of the a priori probability of event H;, ac- 


cording to the new knowledge on the realization E = e. 


Definition 3. Let (Q, A, P) be a probability space and let A and E be two events of A. The events A 


and E are two independent events if and only if 


P(ANE) = P(A) P(E). (11.10) 
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Property 1. Let (Q,.A, P) be a probability space and let A and E, two events of A. 
If P(E) >0, then A and E are two independent events if and only if 


P(A/E) = P(A). (11.11) 


Thus, if A and E are two independent events and if E is not impossible then the probability of A is 


not modified if one receives information on E being realized. 


11.1.2 The mathematical theory of evidence 


The evidence theory or Dempster-Shafer’s theory (DST) of belief functions was born during a lecture 
on inference statistics given by Arthur Dempster at Harvard University during the 60’s. Dempster’s 


main idea has been reinterpreted by Glenn Shafer in his book entitled “A Mathematical Theory of Evi- 


dence” E]. 


Let us consider two spaces Q and O, and a multivalued relation I associating the subset T (w) C O to 
each element w € Q. Let assume that P is a probability measure defined on (Q, A) made of the o-algebra 
A of the subsets of Q. Considering that P represents the probability of occurrence of an uncertain event 
w € Q, and if it is established that this event w is in correspondence with the events 0 € T (w), what 


probability judgment can we make about the occurrence of uncertain events 0 € O? 


Dempster’s view is that the above consideration leads to the concept of compatible probability mea- 
sures. He then refers to the envelope delimited by the lower probability and upper probability of this 


probability family. 


The probability space (Q, A,P) is the information source which allows the quantification of the (im- 


perfect) state of knowledge over the new referential O by means of T. 


In this study, (0, P,T, ©) is called belief structure. By using these mathematical tools, Shafer has 
proposed another interpretation to Dempster’s work. This new interpretation identifies the lower and 


upper probabilities of the family of compatible measures of probability as authentic confidence measures. 


Definition 4. Let © be a finite space and 2° (= P (Q)) the power set of O. A credibility | Cr 
is an application of 2° with values in [0,1] which satisfies the following conditions : 

(i) Cr (Ø) =0, 

(it) Cr (©) =1, 


¡The belief function Cr is denoted Bel in [12] 
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(iii) For all integer n and all family of subsets Ar,..., An of O 


Cr(AU...UAn)> Y. (DU Cr(mieras) (11.12) 


The condition (iii) is called the general suradditivity condition. When n = 2, (12) becomes, 
Cr (Ay U A2) > Cr (Ai) + Cr (Ag) — Cr (41 N A2). (11.13) 


The credibility function allows to quantify the partial information in ©. In theory, other functions are 


associated to Cr, which are equivalent to it: 
e The plausibility function, dual to the credibilities. 


e The elementary probability mass function (also called basic belief assignment or mass function) 


which is obtained from the credibility function by means of the Mobius transform. 


Definition 5. The basic belief assignment is the function m : 2° — [0,1], that satisfies the following 


property 
Y" m(A) =1 (11.14) 
AE29 
with 
m (0) = 0. (11.15) 


The evidence theory is often described as a generalization of probabilistic methods to the treatment 


of uncertainty as it can handle events which are not necessarily exclusive. 


Hence the advantage of being able to represent explicitly the uncertainty from imprecise knowledge. 
The human being easily handled imprecise knowledge. For example, it does not indicate his age to the 
day near, or his height to the inch near, even if it has access to sufficient information. A mathematical 
formulation of the imprecisions has come from Lofti Zadeh through the fuzzy set theory [I5]. The 
modelling of uncertainties due to the imprecisions of knowledge gives rise to possibility theory that 


constitutes with the fuzzy set theory the general framework of the fuzzy logic. 


11.1.3 Fuzzy logic 


The fuzzy logic appeared in 1965 with Lofti Zadeh’s work. The development of the fuzzy logic was 
mainly motivated by the need for a conceptual framework that can address the issue of uncertainty and 
lexical imprecision. From this work, it is necessary to keep the need of formalizing the representation and 
the processing of imprecise or approximate knowledge with the intention to treat systems with a strong 


complexity, in which human factors are often present. Thus, fuzzy logic intervenes to deal with imperfect 
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knowledge. 


The fuzzy logic is based on two main subject matters [9]: fuzzy set theory and modelling of approxi- 


mate reasoning in the framework of possibility theory. 


The definition of a fuzzy subset answers the need to represent imprecise knowledge. The concept 
was introduced to avoid abrupt changes of a class to another(black to the white, for example) and to 
authorize elements so that they cannot belong completely either to one of the classes or to another (to be 
gray in the example). In a reference set O, a fuzzy subset of O is characterized by a membership function 
u w.r.t. A, defined as: 

ua : O — [0,1] 


which is the extension of the classical membership function x, indicator function of the set A that is: 
xa : O — {0,1}. 


To emphasize the difference with the ordinary sets of ©, we use lower case letters for the fuzzy sets 


of O. 


Definition 6. Leta be a fuzzy set of O and let a be a real value in [0,1]. The a — cut aa is the subset 
of O defined by: 
aa = {0 € O; fia (0) > a). (11.16) 
Then Va, 8 € [0,1], 
a < p => ag Ela 
and VO € O, 
Ha (0) = sup {a € [0,1]; 0 € aa}. (11.17) 
This allows the passage from the fuzzy sets to ordinary sets and gives immediately the fuzzy versions 


of the usual operations used for ordinary sets. 


Property 2. Leta and b be two fuzzy sets of O defined by their membership functions Ha and ua, one 


has: 


equality: a = b 4— Y0 € 0, pa (0) = m (0) 


inclusion: A C b 4= V9 E€ 0, pa(0) < m (0) 


e union: aUb—=>VW0€ 0, Haw (0) = max (pa (0) , ua (0)) 


intersection: a N b —> VA E€ O, Hano (0) = min (Ha (9) , po (0)) 


e complement: G — VO EO, pz (8) = (1 — pa (0)) 
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The uncertainties about the truth of a statement are not verified in the case of the fuzzy set theory. 


The possibility theory was introduced in 1978 by Lofti Zadeh in order to manipulate non-probabilistic 
uncertainties for which the probability theory does not give any satisfactory solution. The possibility 
theory provides a framework in which imprecise knowledge and uncertain knowledge can coexist and can 


be treated jointly. 


Possibility theory provides a method to formalize subjective uncertainties on events. It informs us in 
which measure the realization of an event is possible and in which measure we are sure without having 
any evaluation of probabilities at our disposal. One presents the possibility theory in a general form that 


introduces the concepts of possibility measure and necessity measure. 


Consider either the frame 2 (experiment space) or O (space of hypotheses). Set A, a family of subsets 


of Q or subsets of O. When Q or © are finite then A is the set of all subsets. 


Definition 7. A possibility measure Pos is an application of A C P(O) in [0,1] such that: 
i) Pos (0) = 0, Pos (©) = 1. 


it) for any finite family {A;,i € I} of events, one has: 
Pos (U 4) = sup (Pos (A;)).. (11.18) 


According to Zadeh, this is the most pessimistic notion or the most prudent notion for a belief. One 


has in particular: 


max (Pos (A), Pos (A)) =1 (11.19) 
and then: 
Pos (A) + Pos (A) > 1. (11.20) 
11.1.4 Confidence measures 


Definition : A confidence measure c is an application of P (O), parts of O, in [0,1] which verifies the 


following properties: 
i) c(0) =0andc(O)=1 
ii) (monotony) VA,BEP(O), AC B= c(A)<c(B) 
iii) (continuity) For all increasing or decreasing sequences (4,,)y of elements of P (O), one has : 


lim c (An) = c (lim Ap). 
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Consequently, one has: Y A,B € P (O), 
c(AN B) < min(c(A),c(B)) and max(c(A),c(B)) <c(AUB). 


The probabilities, the fuzzy sets, the possibility measures are special cases of the general notion of 


confidence measures. 


11.2 Fusions 


As with Physics, the information fusion modelling aims at giving the best possible description of the 
experimental reality. Let us give the postulates that information fusions need to satisfy. 
11.2.1 Postulates 

1. Coherence or noncontradiction 

2. Continuity of the method 

3. Universality or completeness 


4. No information refusal 


A first consequence is that postulates 2 and 3 leads to use real numbers to represent and compare 
degrees of confidence. However postulate 4 leads to hypothetical conditioning: the confidence degree is 


only known conditionally upon the environment, the context. 


The confidence granted to event A € P (O) in the environment e is noted [A/e]. 


From Edwin Thompson Jaynes [IO]: Obviously, the operation of real human brains is so complicated 
that we can make no pretense of explaining its mysteries; and in any event we are not trying to explain, 
much less reproduce, all the aberrations and inconsistencies of human brains. To emphasize this, instead 
of asking, "How can we build a mathematical model of human common sense?” let us ask, ”How could 
we build a machine which would carry out useful plausible reasoning, following clearly defined principles 


expressing an idealized common sense?” 


11.2.2 Machine on confidence 


We develop the approach essentially based on Cox’s work [5] later detailed by Tribus [14] while criticized. 


i = impossible = 0 < [A/e] < c= certain = 1 
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The various possible relations are listed by setting u £ [A A B/e] that expresses the confidence pro- 


vided by the fusion of A and B within the environment e. Let’s define: 
x = [A/e] v £ [A/ Be] y = [B/e] w £ [B/Ae] 


Eleven functional relations are possible: u = Fi (x,v), u = F> (x,y), u = F3(a,w), u = F; (v,y), 
U= F; (v, w), u = Fe (y, w), u = Fr (x,v, ), u = Fg (x, v, w), u = Fa (x,y, w), u= Fio (v, y, w) and 


u = Fi (x, v, y, w) 


Because of the postulates, the functions Fs, F}, Fio and Fi, have to be discarded. The symmetries 


induce simplifications. The functional relations capable to meet the aspirations, are: 


u = Fy(a,y) = F (y, x) 
u = #3 (a,w) = F; (v,y) 
u = Fy (x,u, y) = Fo (x,y, w) 


The associativity condition on the fusion confidence 
[AA BA C/e] =[AA(BAC) /e] = [(AA B) AC/e] 


discards F7. 


On the other hand, F3 et F> verify the same associativity equation. By calling o the common operation 
describing all the possible fusions between the confidences, this unique equation processes two different 


situations: 
e First case: u = Fb (x,y) = Fa (y, x) 
[A A B/e] = [A/e] o [B/4e] = [B/e] o [A/ Be] 
e Second case: u = F; (x, w) = F4(v,y) 
[AA B/e] = [A/e] o [B/e]. 
This second case was not considered by Cox, the consequences of which constitutes the first results 


of this paper. 


11.2.3 Operator 


e First case: 


[B/Ae] < [B'/Ae] => [AA B/e] < [AA B'/e]. 
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The first case implies strict inequalities on the second variable. The mathematician Aczél [I] has 
given the proof based on the strict monotony of one of both variables. The general solution for the 


functional equation being such that: 
w ([A A^ B/e]) = w ([A/e]) w ([B/Ae]) = w ([B/e]) w ([A/Be]) (11.21) 
where w is a continuous strictly-monotone function of [0,1] onto [0,1]. Thus, 


[AA B/e] = w™ (w ([A/e]) w ([A/Be])) = [A/e] o [B/Ae] 


The fusion operation o is described by a simple product of real numbers after transformation. This 
previous transformation of confidence c(A) = [A/e] on A in the environment e is made by using 
a continuous and strictly monotone function w. This result is easily understood by comparing the 
transformation w with the Fourier transformation. The latter transforms the composition product 


of two functions into the product of their Fourier transform. 


The first case with additional properties gives the probability theory. The problem is to know if 


there is a similar property in the second case. 


e Second case: The strict monotony is not obvious. 


If [A/e] < [A/e] and [B/e] < [B’/e] then [A A B/e] < [A’ A B’/e]. On the other hand, one has the 
commutativity property and o has all the characteristics of a triangular norm, common notion in 
data processing [9]. In this second case, the confidence fusions are associated to the t-norms. The 


second case implies the fuzzy theory. 


113 T-norm 


Definition: A triangular norm - called t-norm - is a function o : [0,1] x [0,1] — [0,1] that verifies the 


following conditions for all x, y, z,t in [0, 1] 
i) (commutativity) woy=you 
ii) (associativity) (roy)oz=xo(yoz) 
iii) (isotony) if «<zandy<t, (roy) < (zot) 
iv) (neutral element 1) (xol)=a 


Example 1. The operator o = min is a t-norm; this is the upper t-norm. For all x,y in [0,1] 


(xo y) < min (za, y) 
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Lemma 1. /f the associated t-norm is strictly increasing, the operator on the confidences is written as 
follows: w[A A B/e] = w([4/e]) w[B/e] where w is a continuous and strictly increasing bijection of 


[0,1] onto [0, 1]. 
According to the additional hypothesis, we retrieve: [A A B/e] = w~'(w ([4/e]) w ([B/e])). 


Theorem 1. The fuzzy operator [A A B/e] = [A/e] A[B/e] = inf {[A/e] ,[B/e]} is the limit of a sequence 


of strictly monotone operators on. 


Proof: Let (T,)n>0 be the family of strictly monotone t-norms such that: 





ne n 
Wn >1, Talx, y) = —— = w, (wn(2)un(y)) with wn = exp— ( z) ; 


1+? ca Gy j 


For all n > 1, wn is a continuous and strictly increasing bijection of [0,1] onto [0,1]. We have for all 


£,y : 
1 


1+ max ( (452) (5) 


lim T (x,y) = 
In fact, if 0O<a<b 

lim Va" +b" = lim b(1+ (F) )" =b 
therefore 

lim T(#,y) = f~ (max (f (x), f (y))) 


where f(x) = += 


T 


max (f (x), f (y)) = f (min(z, y)) 


Since f is strictly decreasing on [0, 1], it follows that 
lim T (x,y) =min(x,y) E. 


Here are the results obtained for several fusion operators. On x-axis, x increases by 0.1 jumps and equally 


on y-axis, y increases by 0.1 jumps. 
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e Result obtained with the product operator: oy Lx y 


0.0100 
0.0200 
0.0300 
0.0400 


0.0500 


0.0600 
0.0700 
0.0800 
0.0900 
0.1000 


e Result obtained with the operator: 


0.0300 
0.0600 
0.0900 
0.1200 
0.1500 
0.1800 
0.2100 
0.2400 
0.2700 
0.3000 


0.0400 
0.0800 
0.1200 
0.1600 
0.2000 
0.2400 
0.2800 
0.3200 
0.3600 
0.4000 


0.0810 0.0975 0.0995 0.0999 


0.0975 
0.0995 
0.0999 
0.1000 
0.1000 
0.1000 
0.1000 
0.1000 
0.1000 


0.1656 
0.1905 
0.1973 
0.1992 
0.1998 
0.1999 
0.2000 
0.2000 
0.2000 


0.1905 
0.2538 
0.2838 
0.2947 
0.2984 
0.2996 
0.2999 
0.3000 
0.3000 


0.1973 
0.2838 
0.3460 
0.3794 
0.3933 
0.3982 
0.3996 
0.4000 
0.4000 


0.0500 
0.1000 
0.1500 
0.2000 
0.2500 
0.3000 
0.3500 
0.4000 
0.4500 
0.5000 


0.0600 
0.1200 
0.1800 
0.2400 
0.3000 
0.3600 
0.4200 
0.4800 
0.5400 
0.6000 


0.0700 
0.1400 
0.2100 
0.2800 
0.3500 
0.4200 
0.4900 
0.5600 
0.6300 
0.7000 


0.0800 
0.1600 
0.2400 
0.3200 
0.4000 
0.4800 
0.5600 
0.6400 
0.7200 
0.8000 


0.0900 
0.1800 
0.2700 
0.3600 
0.4500 
0.5400 
0.6300 
0.7200 
0.8100 
0.9000 











0.1000 0.1000 0.1000 0.1000 


0.1992 
0.2947 
0.3794 
0.4425 
0.4784 
0.4937 
0.4987 
0.4999 
0.5000 


0.1998 0.1999 0.2000 


0.2984 
0.3933 
0.4784 
0.5435 
0.5810 
0.5959 
0.5996 
0.6000 


0.2996 
0.3982 
0.4937 
0.5810 
0.6494 
0.6872 
0.6988 
0.7000 


0.2999 
0.3996 
0.4987 
0.5959 
0.6872 
0.7605 
0.7955 
0.8000 


i ; A 
As soon as n = 3 we observe how near this operator approximates x o y = 


e Result obtained with the fusion operator: xoy 2 min(x, y) 


0.1000 
0.1000 
0.1000 
0.1000 
0.1000 
0.1000 
0.1000 
0.1000 
0.1000 


0.1000 


0.1000 
0.2000 
0.3000 
0.3000 
0.3000 
0.3000 
0.3000 
0.3000 
0.3000 
0.3000 


0.1000 
0.2000 
0.3000 
0.4000 
0.4000 
0.4000 
0.4000 
0.4000 
0.4000 
0.4000 


0.1000 
0.2000 
0.3000 
0.4000 
0.5000 
0.5000 
0.5000 
0.5000 
0.5000 
0.5000 


0.1000 
0.2000 
0.3000 
0.4000 
0.5000 
0.6000 
0.7000 
0.7000 
0.7000 
0.7000 


0.1000 
0.2000 
0.3000 
0.4000 
0.5000 
0.6000 
0.7000 
0.8000 
0.8000 
0.8000 


0.1000 
0.2000 
0.3000 
0.4000 
0.4999 
0.5996 
0.6988 
0.7955 
0.8772 
0.9000 





min(x, y). 


0.1000 
0.2000 
0.3000 
0.4000 
0.5000 
0.6000 
0.7000 
0.8000 
0.9000 
0.9000 
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It was not obvious to obtain the functions wn. The fuzzy operator o = min comes from a limit of 
fusions on, each admitting after confidence transformation wpn [A/e] and wn [B/e], a decomposition in a 


conventional product of real numbers. 


11.3.1 Independence-interdependence 


The second functional relation 
w ([A A B/e]) = w([A/e]) w ([B/e]) 


is discarded if we consider there is a link between the knowledge of two facts in a given environment. 
This constraint, admitted by Cox then by Tribus, is however not valid for all uncertainty models. Let 
us give two examples for which the argument given by Tribus is insufficient. In the probability theory, 
randomly taking of balls with or without replacement leads to two different models. The testimony of 


different persons is another example. The testimonies can be obtained separately or in a meeting. 


Thus, because of the acquisition conditions of the knowledge, the postulates lead to two distinct the- 


ories: the probability theory and the fuzzy logic. 


In addition, from the four basic postulates explained above and valid for the three theories (proba- 
bility theory, evidence theory and fuzzy logic), and while adding the hypothesis of interdependence and 
admitting a postulate of precision leading to the additive rule, one would obtain the probabilities as well 


as the transition probabilities and therefore the credibilities. 


11.3.2 T-norm description 


We have also obtained a result characterizing the t-norms by correcting and extending a previous demon- 


stration [II]. This is our third result. 
Theorem 2. Leto be a continuous t-norm of [0,1] x [0,1] > [0,1]. Then, the interval [0, 1] is the union 
1. of closed intervals [b,c] over which the equality s o s = s is satisfied and 


2. of open intervals (a,b) for which aoa =a and bob = b and for which the inequality s o s Æ s is 
satisfied. 
For the intervals |b, c] of first kind : Vx € [b,c], Vy € [z,1], roy=xAy 


For each second kind interval (a,b) there exists a function w strictly increasing from |a, b] into [0, 1] 
such that w(b) = 1 
IfYs € (a,b) sos#a then w(a) =0 and Vx,y€ [a,b]  xoy=w"! (w (x)w (y)) 





If3s€ (a,b) sos=a then w(a) >0 and Vx,y € [a,b] xoy=w"! (w(x)w(y)) Va 
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On each like-interval (a,b), the operation o can be constant when «x varies from a. However, the 
interval within the function is really constant depending upon the value of the second variable y. The 


separation curve {(x, y) € [a,b] x [a,b]; xo y = a} in the space [a,b] x [a,b] is given by the equality 
w(x o y) = w(a) = w(z)w(y). 
This theorem results from the lemmas hereafter. 
Lemma 2. The set {x € [0,1]; T(x, x)= x} is a union of closed intervals of the interval [0, 1]. 


Any adherence point s of a sequence (s,;n € N, T(s,, Sn) = sn) satisfies T(s,s) = s with respect 
to the continuity of T, and therefore s belongs to the closed interval. Thus, for example, the set 
{[0] : +, =| ¿n € N} constitutes an infinite family of closed intervals. On each of the open intervals 
of the countable infinity of the complementary set, it is sufficient to define a t-norm by means of a 
continuous and increasing function w. Each of these functions w depends on the open interval under 


consideration. 


Lemma 3. Ifa exists in the open interval (0,1) such that T(a,a) 4 a then there are two real values 
a,b satisfying the inequalities0 <a<a<b< 1 as well as the equalities T(a,a) = a and T(b,b) = b. 


Furthermore, for all real values in the open interval (a,b), the inequality T(s,s) 4 s is satisfied. 


Lemma 4. Let T be a continuous t-norm. For all pair (x,y) of [0,1] such that there exists a, £ < a < y 
with T(a, a) = a, we have: 


T(x, y) = x =min(x, y). 


Any continuous t-norm T coincides over [0,1] x [0,1] with the min function, except for the points 


(x,y), x < y for which one cannot find a real a such that: 
u<a<yet T(a,a) =a. 
One has to study the behavior of T in the regions [a,b] x [a,b] of the intervals [a,b] of the second kind. 


Lemma 5. Consider the associative and commutative operation o of [a,b] x [a,b] — [a,b] which is 
continuous and decreasing with respect to both variables and such that aoa =a and bob = b but such that 
for all s in the open interval (a,b), one has the inequality sos #s. Let u be in the closed interval [a,b], 
upper bound of v such that vov =a, that is such that u = sup {v € [a,b]; v0v =a}. The operation o is 
strictly increasing for each of both variables wherever x o y £a, and if u =a then o is strictly increasing 


over [a,b] x [a,b]. 


Lemma 6. Under valid conditions of application of lemma[á] if u = a, then for all a in (a,b) and for 


all nonzero positive rational number q, the real power a°% is defined and is a real number in the (a,b). 
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Remark 1. It can easily be verified that: 


and thus: 


o rm+sn * 


os 00°F =a ogee = oo = (B+) 


Lemma 7. Under valid conditions of application of lemmal¿] if u = a the application q € Qi => a° € 


(a,b) is strictly decreasing and satisfies to limgo a°? = b and lim¿- 300 0% = a. 


: ; A ; : : 
Lemma 8. The application r € [0,00) > a” = sup {a%;r < q) is continuous and decreasing and 


a Ê inf [a%% q <r}. 


Lemma 9. Under valid conditions of application of lemma [ if u > a, one defines the application 
r € [0,00) — u” in [a,b] as previously. With u°” strictly decreasing over [0,2] such that u°° = b, 


u? =a, and for allr>2 wu" =a. 


Lemma 10. Under valid conditions of application of lemmal[3l if u > a, one defines for all a € (a,b), 
the application r € [0,oo[> a” in [a,b]. In this case, there is a positive real number ro such that a°” is 


strictly decreasing over [0, ro], and a°° = b, a” =a, and for allr > ro a =a. 


Lemma 11. Consider the associative and commutative operation o of [a,b] x [a,b] — [a,b] continuous 
and strictly increasing with respect to both variables such that aoa =a and bob = b but one has the 
inequality sos # s for all s in the open interval (a,b). Therefore, there is a continuous and strictly 


increasing function w such that: 
z o y = w}(w(x)w(y)) V a = max(a, w-!(w(2)w(y))) (11.22) 


The results of the lemmas finish the justification of the theorem P] 


11.4 Conclusions 


Finally, the same postulates applied on confidences, in different environments (either in dependence 
or independence situation), imply the same foundation for the various modern theories of information 
fusion in the framework of uncertainty by using deductions that we have unified. The independence 
between elementary confidences does not need to be understood in the probabilistic sense. The formula 
P(A/e) = P(A) of the probability of A in the environment e has no sense. One has to find another 


conceptualization of the notion of independence moving away from the probabilistic concept. 


We must make new models when fusion analysis is to be applied in all situations. We take the simple 
example of logical implication 


P and Q>R 
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Every logical proposition P, Q, R takes only one of the two numerical values 0,1. Yet with Probability 
these propositions are able to take any numerical value in the interval [0,1] to represent the statistical 
limit of existence when the experiment is repeated as often as possible. Nowadays, the numbers [P/e], 


[Q/e] and [R/e] only give the intuitive beliefs when the conditions e on the surroundings are well defined. 


To be more explicit, let take a plausible medical situation. Many patients present chaotic neurologic 


disorders. Does the deterministic chaos P with the drug Q result in the end of illness R? 


We have no reason in such a medical situation to introduce the limitation of logical implication. More- 
over, we have the fusion ”and” about the two beliefs [P/e] on the disorder P and [Q/e] on the efficiency 
of drug Q and we expect this fusion to give precisely the belief [R/e] of the recovery R from the two 
beliefs [P/e] and [Q/e]. 


In addition, let us take the discussion of Zadeh’s example, discussed in Chapter [5] in order to make 


a new analysis with our fusion principles. One has the values 
m(M) =0 m(C) =0 m(T)=1 


(M standing for Meningitis, C for contusion and T for tumor) for the masses from Dempster-Shafer 


renormalization where the normalization coefficient is 


1 — m(@) = 0.0001 


From our principles, it is possible to give a belief for the global model. Without renormalization the 


two doctors give the beliefs 


[T/e], =0.01  [T/ela = 0.01 


With the principles of this chapter, the numerical value for any fusion arising from these two beliefs 
is equal to or less than 0.01 = min([7/e]1, [T/e]2). So the Dempster-Shafer normalization is not a fusion! 
The normalization is in contradiction with the arguments of this chapter. Note that the hybrid DSm rule 
of combination proposed in Chapter[]] provides in this example explained in details in Chapter[B] (Section 
E31) Cr(T) = m(T) = 0.0001 < min([T/e]1, [T/e]2) which is coherent with a confidence measure. 


The probable explanation is that the Dempster-Shafer normalization is the only mistake of the model. 
One supposes global cohesion between initial mass values coming from Demspster-Shafer rules. In math- 
ematics, we know it is often impossible to adjust analytical functions in the whole complex plan C; global 


cohesion is impossible! For example the logarithmic function is defined in any neighbourhood but it is not 
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defined in the whole complex plan. The global cohesion is probably the mistake. The DSmT framework 
seems to provide a better model to satisfy confidence measures and fusion postulates. Some theoretical 


investigations are currently done to fully analyze DSmT in the context of this work. 


Another way to explain losses of mass in Dempster-Shafer theory is to introduce new sets. In any 
probability diffusion, we observe occasionally probability masses loading infinity with an evolution. Let 
us take the mass 1 in position {n} and increase n to infinity we have no more mass on the real line 
R. Similarly, let us take the masses 0.5 on {—n} and 0.5 on {n}; this time we load {—oo} and {oo}, 
n increasing to infinity. In Dempster-Shafer model, one sometimes loads the empty set {0} and (or) an 


extra set, only to explain vanishing masses. 


Probably Dempster-Shafer renormalization is the only mistake of the model because false global prop- 


erty of masses is supposed. It is important to know the necessary axioms given renormalization truth. 


Surroundings are so different that fusion described only by product is certainly a construction that is 


too restrictive. 


The processing in concrete application of the results presented here suppose additional hypotheses, 
since any information fusion introduces monotone functions strictly increasing whose existence is proven 
in this paper. These functions (not only one!) remain to be identified for each application. Theoretical 
considerations should allow to keep certain typical families of functions. Experimental results would next 


identify some unknown parameters if some parameterized family of such functions. 


Applications of such a methodology on the information fusion such as air pollution measures given 


by sensors will be processed. 


Moreover, during its time evolution, the information data fusion can thus be described by successive 


t-norms amongst which probability should be introduced. 
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Chapter 12 


On the Tweety Penguin Triangle 


Problem 


Jean Dezert Florentin Smarandache 
ONERA Department of Mathematics 
29 Av. de la Division Leclerc University of New Mexico 
92320 Chátillon Gallup, NM 8730 
France U.S.A. 


Abstract: In this chapter, one studies the famous well-known and challenging 
Tweety Penguin Triangle Problem (TPTP or TP2) pointed out by Judea Pearl in 
one of his books. We first present the solution of the TP2 based on the fallacious 
Bayesian reasoning and prove that reasoning cannot be used to conclude on the abil- 
ity of the penguin-bird Tweety to fly or not to fly. Then we present in details the 
counter-intuitive solution obtained from the Dempster-Shafer Theory (DST). Fi- 
nally, we show how the solution can be obtained with our new theory of plausible and 


paradoxical reasoning (DSmT). 
12.1 Introduction 


udea Pearl claimed that DST of evidence fails to provide a reasonable solution for the combination 
J of evidence even for apparently very simple fusion problem [IJ] [12]. Most criticisms are answered by 
Philippe Smets in 22123]. The Tweety Penguin Triangle Problem (TP2) is one of the typical exciting and 
challenging problem for all theories managing uncertainty and conflict because it shows the real difficulty 
to maintain truth for automatic reasoning systems when the classical property of transitivity (which is 


basic to the material-implication) does not hold. In his book, Judea Pearl presents and discusses in 
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details the semantic clash between Bayes vs. Dempster-Shafer reasoning. We present here our analysis 
on this problem and provide a new solution of the Tweety Penguin Triangle Problem based on our new 
theory of plausible and paradoxical reasoning, known as DSmT (Dezert-Smarandache Theory). We show 
how this problem can be attacked and solved by our new reasoning with help of the (hybrid) DSm rule 
of combination (see chapter Æ). The purpose of this chapter is not to browse all approaches available in 
literature for attacking the TP2 problem but only to provide a comparison of the DSm reasoning with 
respect to the Bayesian reasoning and to the plausible reasoning of DST framework. Interesting but 
complex analysis on this problem based on default reasoning and e-belief functions can be also found 
by example in [22] and [I]. Other interesting and promising issues for the TP2 problem based on the 
fuzzy logic of Zadeh jointly with the theory of possibilities are under investigations. Some 
theoretical research works on new conditional event algebras (CEA) have emerged in literature [7] since 
last years and could offer a new track for attacking the TP2 problem although unfortunately no clear 
didactic, simple and convincing examples are provided to show the real efficiency and usefulness of these 


theoretical investigations. 


12.2 The Tweety Penguin Triangle Problem 


This very important and challenging problem, as known as the Tweety Penguin Triangle Problem (TP2) 
in literature, is presented in details by Judea Pearl in [11]. We briefly present here the TP2 and the 
solutions based first on fallacious Bayesian reasoning and then on the Dempster-Shafer reasoning. We 


will then focus our analysis of this problem from the DSmT framework and the DSm reasoning. 


Let's consider the set R = fr1,r2,r3) of given rules (as known as defaults in [1]): 
e rı: "Penguins normally don’t fly” > (p — ~f) 
e ro: "Birds normally fly” > (b= f) 
e r3: "Penguins are birds” > (p > b) 


To emphasize our strong conviction in these rules we commit them some high confidence weights w1, wa 
and ws in [0,1] with wı =1—e1, w2 = 1 — €2 and w3 = 1 (where e and ez are small positive quantities). 


The conviction in these rules is then represented by the set W = {w1, w2, ws} in the sequel. 


Another useful and general notation adopted by Judea Pearl in the first pages of his book to 


characterize these three weighted rules is the following one (where w1, w2, w3 € [0, 1]): 


w 


rip (nf) rx:bB f  r:p3b 
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When 101, wa, w3 € {0, 1} the classical logic is the perfect tool to conclude on the truth or on the falsity 
of a proposition built from these rules based on the standard propositional calculus mainly with its three 
fundamental rules (Modus Ponens, Modus Tollens and Modus Barbara - i.e. transitivity rule). When 
0 < w1,w2,w3 < 1, the classical logic can’t be applied because the Modus Ponens, the Modus Tollens 
and the Modus Barbara do not longer hold and some other tools must be chosen. This will discussed in 


detail in section 3.2. 


Question: Assume we observe an animal called Tweety (T) that is categorically classified as a bird (b) 
and a penguin (p), ie. our observation is O £ [T = (bN p)] = [(T = b) N (T = p)]. The notation 
T = (b N p) stands here for ” Entity T holds property (b N p)”. What is the belief (or the probability - if 
such probability exists) that Tweety can fly given the observation O and all information available in our 


knowledge base (i.e. our rule-based system R and W) ? 


The difficulty of this problem for most of artificial reasoning systems (ARS) comes from the fact 
that, in this example, the property of transitivity, usually supposed satisfied from material-implication 
interpretation [LI], (p — b,b — f) > (p —> f) does not hold here (see section [12,3,2). In this interesting 
example, the classical property of inheritance is thus broken. Nevertheless a powerful artificial reasoning 
system must be able to deal with such kind of difficult problem and must provide a reliable conclusion 
by a general mechanism of reasoning whatever the values of convictions are (not only restricted to values 
close to either 0 or 1). We examine now three ARS based on the Bayesian reasoning [II] which turns to 
be fallacious and actually not appropriate for this problem and we explain why, on the Dempster-Shafer 


Theory (DST) [16] and on the Dezert-Smarandache Theory (DSmT) (see part I of this book). 


12.3 The fallacious Bayesian reasoning 


We first present the fallacious Bayesian reasoning solution drawn from the J. Pearl’s book in (pages 
447-449) and then we explain why the solution which seems at the first glance correct with intuition is 
really fallacious. We then explain why the common rational intuition turns actually to be wrong and 


show the weakness of Pearl’s analysis. 


12.3.1 The Pearl’s analysis 


To preserve mathematical rigor, we introduce explicitly all information available in the derivations. In 


other words, one wants to evaluate using the Bayesian reasoning, the conditional probability, if it exists, 





P(T = FlO, R,W) = P(T = f|T = p,T = b, R,W). The Pearl's analysis is based on the assumption that 





a conviction on a given rule can be interpreted as a conditional probability (see [LI] page 4). In other 
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words if one has a given rule a 3 b with w € [0, 1] then one can interpret, at least for the calculus, w as 
P(bla) and thus the probability theory and Bayesian reasoning can help to answer to the question. We 
prove in the following section that such model cannot be reasonably adopted. For now, we just assume 
that such probabilistic model holds effectively as Judea Pearl does. Based on this assumption, since the 
conditional term/information (T = p,T = b, R, W) is strictly equivalent to (T = p, R, W) because of the 
knowledge of rule r3 with certainty (since w3 = 1), one gets easily the fallacious intuitive expected Pearl’s 


result: 





P(T = f|O, R,W) = P(T = f|T = p,T = b, R, W) 





P(T = f|O, R,W) = P(T = f|T = p, R,W) 


P(T = f\O,R,W) =1-P(T =>f|T =p, R,W) 





P(T = f|O,R,W)=1 -w = 6 


From this simple analysis, the Tweety’s ” birdness” does not render her a better flyer than an ordinary 
penguin as intuitively expected and the probability that Tweety can fly remains very low which looks 
normal. We reemphasize here the fact, that in his Bayesian reasoning J. Pearl assumes that the weight 
w: for the conviction in rule rı can be interpreted in term of a real probability measure P(~f|p). This 
assumption is necessary to provide the rigorous derivation of P(T = f|O,R,W). It turns out however 
that convictions w; on logical rules cannot be interpreted in terms of probabilities as we will prove in the 


next section. 


When rule r3 is not asserted with absolute certainty (i.e. w3 = 1) but is subject to exceptions, i.e. 
w3 = 1 — e3 < 1, the fallacious Bayesian reasoning yields (where notations T = f, T = b and T = p are 


replaced by f, b and p due to space limitations): 


P(f|O, R, W) = P(flp, b, R, W) 


BIR, W 
Pro, rw) = Pb RW) POR, W) 


P(blp, R, W)P(p|R, W) 


By assuming P(p|R,W) > 0, one gets after simplification by P(p|R, W) 


P(f,blp, R, W 
PIO RW) = RaT 

P(b R,W)P R.W 
pejo, r, w) = ae eee 


If one assumes P(blp, R, W) = w3 = 1 — e3 and P(flp, R,W) = 1 — P(-f|p, R,W) = 1 — wi = «1, one 


gets 
€1 


P(f|O, R,W) = PŒ f.p, R,W) x > 
— €3 
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Because 0 < P(b|f,p,R,W) < 1, one finally gets the Pearl's result [II] (p.448) 


€l 





P(f|O, R,W) < (12.1) 


1 — €63 
which states that the observed animal Tweety (a penguin-bird) has a very small probability of flying 
as long as eg remains small, regardless of how many birds cannot fly (e2), and has consequently a high 
probability of not flying because P(f|O,R,W)+P(f|O,R,W) = 1 since the events f and f are mutually 


exclusive and exhaustive (assuming that the Pearl’s probabilistic model holds ... ). 


12.3.2 The weakness of the Pearl's analysis 


We prove now that the previous Bayesian reasoning is really fallacious and the problem is truly unde- 
cidable to conclude about the ability of Tweety to fly or not to fly if a deep analysis is done. Actually, 
the Bayes’ inference is not a classical inference (see chapter B] for justification). Indeed, before applying 
blindly the Bayesian reasoning as in the previous section, one first has to check that the probabilistic 
model is well-founded to characterize the convictions of the rules of the rule-based system under anal- 
ysis. We prove here that such probabilistic model doesn’t hold for a suitable and useful representation 
of the problem and consequently for any problems based on the weighting of logical rules (with positive 


weighting factors/convictions below than 1). 


12.3.2.1 Preliminaries 


We just remind here only few important principles of the propositional calculus of the classical Mathe- 
matical Logic which will be used in our demonstration. A simple notation, which may appear as unusual 
for logicians, is adopted here just for convenience. A detailed presentation of the propositional calculus 
and Mathematical Logic can be easily found in many standard mathematical textbooks like [15] [0] [9]. 


Here are these important principles: 


e Third middle excluded principle : A logical variable is either true or false, i.e. 
aV 7a (12.2) 
e Non-contradiction law : A logical variable can’t be both true and false, i.e. 
=(a Aa) (12.3) 


e Modus Ponens : This rule of the propositional calculus states that if a logical variable a is true 


and a > b is true, then b is true (syllogism principle), i.e. 


(a^ (a— b)) >b (12.4) 
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e Modus Tollens : This rule of the propositional calculus states that if a logical variable —b is true 


and a — bis true, then ~a is true, i.e. 


(=b A (a > b)) > ~a (12.5) 


e Modus Barbara : This rule of the propositional calculus states that if a — b is true and b —> cis 


true then a — c is true (transitivity property), i.e. 


((a > b) A(b + 0) 3 (a 0) (12.6) 


From these principles, one can prove easily, based on the truth table method, the following property 


(more general deducibility theorems in Mathematical Logic can be found in [18] [19]) : 


((a => b) A (c > d)) > ((aAc) > (b ^ d)) (12.7) 


12.3.2.2 Analysis of the problem when e, = €2 = €3 = 0 


We first examine the TP2 when one has no doubt in the rules of our given rule-based systems, i.e. 


ri : p T595 (af) 
ra :b AA f 


w3=1-e3=1 
3 A b 


T3: P 


From rules rı and rz and because of property (12.7), one concludes that 
pAb>(f ^f) 
and using the non-contradiction law (12:3) with the Modus Tollens (12.5), one finally gets 


a(f Anf) — (p ^b) 


which proves that p A b is always false whatever the rule r3 is. Interpreted in terms of the probability 
theory, the event T = pN b corresponds actually and truly to the impossible event Ø since T = f and 
T = f are exclusive and exhaustive events. Under such conditions, the analysis proves the non-existence 


of the penguin-bird Tweety. 


If one adopts the re | of the probability theory, trying to derive P(T = f|T = pN b) and 
P(T = f|T = pnb) with the Bayesian reasoning is just impossible because from one of the axioms of the 
probability theory, one must have P(Ø) = 0 and from the conditioning rule, one would get expressly for 


this problem the indeterminate expressions: 
lBecause probabilities are related to sets, we use here the common set-complement notation f instead of the logical 


negation notation =f, N for A and U for V if necessary. 
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P(T = f|T =pnb) = P(T = f|T =0) 

P(T = f|T =pnb) ae 

P(T = f|T = pnb) L — - 

P(T =f|T = pnb) 2 (indeterminate) 
and similarly 

P(T = f|T = pnb) = P(T = f|T = 0) 

Par = jir = p00) = "E 20 

P(T = ĴfIT =pnb) > — - 

P(T = FT =pNb) 2 (indeterminate) 


12.3.2.3 Analysis of the problem when 0 < €1,€2,€3 < 1 


Let’s examine now the general case when one allows some little doubt on the rules characterized by taking 


€1 Z0, €2 2 0 and eg 20 and examine the consequences on the probabilistic model on these rules. 


First note that, because of the third middle excluded principle and the assumption of the existence 
of a probabilistic model for a weighted rule, then one should be able to consider simultaneously both 


” probabilistic/Bayesian” rules 


(12.8) 


In terms of classical (objective) probability theory, these weighted rules just indicate that in 100 x w 
percent of cases the logical variable b is true if a is true, or equivalently, that in 100 x w percent of cases 
the random event b occurs when the random event a occurs. When we don’t refer to classical probability 
theory, the weighting factors w and 1 — w indicate just the level of conviction committed to the validity 
of the rules. Although very appealing at the first glance, this probabilistic model hides actually a strong 
drawback/weakness especially when dealing with several rules as shown right below. 

Let's prove first that from a ” probabilized” rule a al b one cannot assess rigorously the convic- 


tions onto its Modus Tollens. In other words, from (12.8) what can we conclude on 


(12.9) 
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From the Bayes’ rule of conditioning (which must hold if the probabilitic model holds), one can express 


P(alb) and P(a|b) as follows 





Pa) = 1 ~ Pla) = 1 ~ fig = 1 ~ Ao 


a P(anb P(bla)P(a 
P(alb) = 1 — P(a\b) = 1 — oe pa (la) PCa) 





or equivalently by replacing P(b|a) and P(b\a) by their values w and 1 — w, one gets 


AD lo (12.10) 





P(alb) =1—wE2 


These relationships show that one cannot fully derive in theory P(alb) and P(alb) because the prior 


probabilities P(a) and P(b) are unknown. 


A simplistic solution, based on the principle of indifference, is then just to assume without solid jus- 
tification that P(a) = P(a) = 1/2 and P(b) = P(b) = 1/2. With such assumption, then one gets the 
following estimates P(a|b) = w and P(a|b) = 1 — w for P(a|b) and P(alb) respectively and we can go 


further in the derivations. 


Now let's go back to our Tweety Penguin Triangle Problem. Based on the probabilistic model (assumed 


to hold), one starts now with both 





ee Pis e D P(flp)=en j 
ro ; b P(f ee f b P( [b)=e2 af (12.11) 
P(blp)=1—e3 P(blp)=es 
13:P > b p = `b 
Note that taking into account our preliminary analysis and accepting the principle of indifference, one 


has also the two sets of weighted rules either 


f PISE e Ap nf PPIP =e: Ap 
af Oe Fp A (12.12) 
ah P(p|b)=1—€s sp b P@pld)=es = 


One wants to assess the convictions (assumed to correspond to some conditional probabilities) into the 


following rules 


pab ey (12.13) 


(flpnb)=? 
= 


pab” -f (12.14) 
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The question is to derive rigorously P(f|pN b) and P(f|p/Mb) from all previous available information. It 
turns out that the derivation is impossible without unjustified extra assumption on conditional indepen- 


dence. Indeed, P(f|pMb) and P(flp N b) are given by 





_ P(f.p,b) _ P(p,b\f)P Uf 
P(f|p nb) = O = Pe 


(12.15) 


z _ P(fpb) _ P(pb APF 
P(flpnb) = Tga ts) = i “ee 


If one assumes as J. Pearl does, that the conditional independence condition also holds, i.e. P(p, b| f) = 


P(p|f)P(b|f) and P(p, of) = P(p|f)P(O|f), then one gets 


— PolAPolpPTe 
P(fIPNb) = Moy 


F — PelfPOlpPT) 
P(Flpn b) = A 


By accepting again the principle of indifference, P(f) = P(f) = 1/2 and P(p) = P(p) = 1/2, one gets 
the following expressions 


P(f\p Nb) = Peel) 


(12.16) 


Apr) = PRO 
Replacing probabilities P(p| f), P(b|f), P(blp), P(p| f) and P(b|f) by their values in the formula (12.10), 
one finally gets 
Pipo b) = 32) 


1—e3 


(12.17) 


iP i- 
P(flpnb) = Re 

Therefore we see that, even if one accepts the principle of indifference together with the conditional 

independence assumption, the approximated ” probabilities” remain both small and do not correspond to 


a real measure of probability since the conditional probabilities of exclusive elements f and f do not add 


up to one. When e,, €2 and ez tends towards 0, one has 
P(fipnb) + P(flpnb) 0 


Actually our analysis based on the principle of indifference, the conditional independence assumption 
and the model proposed by Judea Pearl, proves clearly the impossibility of the Bayesian reasoning to 
be applied rigorously on such kind of weighted rule-based system, because no probabilistic model exists 
for describing correctly the problem. This conclusion is actually not surprising taking into account the 


Lewis’ theorem [I3] explained in details in [7] (chapter 11). 
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Let’s now explain the reason of the error in the fallacious reasoning which was looking coherent with 
the common intuition. The problem arises directly from the fact that penguin class and bird class are 
defined in this problem only with respect to the ”flying” and ”not-flying” properties. If one considers 
only these properties, then none Tweety animal can be categorically classified as a penguin-bird, because 
penguin-birdness doesn’t not hold in reality based on these exclusive and exhaustive properties (if we 
consider only the information given within the rules rı, r2 and r3). Actually everybody knows that 
penguins are effectively classified as bird because ”birdness” property is not defined with respect to 
the ” flying” or ”not-flying” abilities of the animal but by other zoological characteristics C (birds are 
vertebral oviparous animals with hot blood, a beak, feather and anterior members are wings) and such 
information must be properly taken into account in the rule-based systems to avoid to fall in the trap of 
such fallacious reasoning. The intuition (which seems to justify the fallacious reasoning conclusion) for 
TP2 is actually biased because one already knows that penguins (which are truly classified as birds by 
some other criterions) do not fly in real world and thus we commit a low conviction (which is definitely 
not a probability measure, but rather a belief) to the fact that a penguin-bird can fly. Thus the Pear’ls 


analysis proposed in appears to the authors to be unfortunately incomplete and somehow fallacious. 


12.4 The Dempster-Shafer reasoning 


As pointed out by Judea Pearl in [II], the Dempster-Shafer reasoning yields, for this problem, a very 
counter-intuitive result: birdness seems to endow Tweety with extra flying power ! We present here our 


analysis of this problem based on the Dempster-Shafer reasoning. 


Let's examine in detail the available prior information summarized by the rule r1: ” Penguins normally 
dont fly’ > (p — f) with the conviction w = 1 — e, where e, is a small positive number close to zero. 
This information, in the DST framework, has to be correctly represented in term of a conditional belief 


Bel, (flp) = 1 — 1 rather than directly the mass mi(f N p) = 1-1. 


Choosing Bel, (fp) = 1 — e, means that there is a high degree of belief that a penguin-animal is also 
a nonflying-animal (whatever kind of animal we are observing). This representation reflects perfectly 
our prior knowledge while the erroneous coarse modeling based on the commitment m,(f Mp) = 1 — «1 
is unable to distinguish between rule rı and another (possibly erroneous) rule like r| : (=f — p) hav- 
ing same conviction value w1. This correct model allows us to distinguish between rı and ri (even if 
they have the same numerical level of conviction) by considering the two different conditional beliefs 
Bel, (flp) = 1 — 1 and Bely (p|f) = 1 — 1. The coarse/inadequate basic belief assignment modeling (if 


adopted) in contrary would make no distinction between those two rules rı and ri since one would have 
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to take mi(f N p) = my: (pN f) and therefore cannot serve as the starting model for the analysis 


Similarly, the prior information relative to rules ra : (b — f) and r3 : (p — b) with convictions 
wa = 1-2 and wz = 1 — e3 has to be modeled by the conditional beliefs Belg(f|b) = 1 — e2 and 


Bels(b|p) = 1 — ez respectively. 


The first problem we have to face now is the combination of these three prior information character- 
ized by Bel; (f|p) = 1 — €1, Belo(f|b) = 1 — ez and Bel3(b|p) = 1 — ez. All the available prior information 
can be viewed actually as three independent bodies of evidence B¡, B2 and Bs providing separately the 
partial knowledges summarized through the values of Bel (f|p), Belo(f|b) and Bel3(b|p). To achieve the 
combination, one needs to define complete basic belief assignments m1(.), ma(.) and m3(.) compatible 
with the partial conditional beliefs Bel, (f|p) = 1 — 1, Belo(f|b) = 1 — ez and Belz (b|p) = 1 — ez without 
introducing extra knowledge. We don’t want to introduce in the derivations some extra-information we 
don’t have in reality. We present in details the justification for the choice of assignment my (.). The choice 


for ma(.) and mg(.) will follow similarly. 


The body of evidence Bı provides some information only about f and p through the value of Bel, (flp) 
and without reference to b. Therefore the frame of discernment ©; induced by Bı and satisfying Shafer’s 


model (i.e. a finite set of exhaustive and exclusive elements) corresponds to 
O1 = {01 = FNP, 02 = f NP, 03 = fp, 04 = fp} 


schematically represented by 


p=03U04 


b42 fNp| [0 2 
2 fNp| |= fap 


p=01U02 


f =02U6af }F=0, U03 


a 
=) 
3 


The complete basic assignment m (.) we are searching for and defined over the power set 2%: which must 
be compatible with Bel, (f[p) is actually the result of the Dempster’s combination of an unknown (for 
now) basic belief assignment m/ (.) with the particular assignment m} (.) defined by mY (p £ 03 U 04) = 1; 
in other worlds, one has 


mi(.) = [m ® mi](.) 


From now on, we introduce explicitly the conditioning term in our notation to avoid confusion and thus we 
use m1(.]p) = m1(.[03U 64) instead m1(.). From m} (p £ 03 U 04) = 1 and from any generic unknow basic 
assignment m‘(.) defined by its components m/ (Ø) = 0, m4 (61), m4 (82), m4 (03), m4 (64), m4 (01 U 42), 


mi (01 U 63), mi (61 U 64), mi (02 U 63), mi (02 U 04), mi (03 U 04), mi (64 U b2 U 03), mi (01 U 02 U 64), 
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mi (01 U 63 U 64), m4 (02 U 03 U 04), m} (01 U 02 U 03 U 04) and applying Dempter’s rule, one gets easily the 
following expressions for mı (.|93 U 04). All mı (.|03 U 04) masses are zero except theoretically 
1 
1 f 1 1 1 
m1 (03 03 U 04) = m4 (03 U 64) mi (03) + mi (01 U 03) ar mi (02 U 63) os mi (01 U 02 U 63)|/ By 
1 
W f 1 lA f 
mı (04 03 U 04) = mi (03 U 64) mi (04) + mi (04 U 04) +m, (02 U 64) + m; (04 U 62 U 04) /Kı 
1 


= 
mı (03 U 04 03 U 04) = mi (03 U 64) m/ (03 U 84) +m; (01 U 43 U 84) +m; (02 U 03 U 64) +m; (01 U bə U 03 U 04)|/ Fy 

















with ‘ 
A dd lA / 1 
Kı Š 1 — mi (93 U 04) [mi (01) + mi (02) + m1 (01 U 02)] 
To complete the derivation of m1 (.|03 U04), one needs to use the fact that one knows that Bel: (f|p) = 
1 — e, which, by definition [16], is expressed by 
Bel, (Flp) = Bel; (01 U 03103 U 94) = mı (01/03 U 04) + mı (03/03 U 04) + m4 (64 U 03103 U 64) =1-e 


But from the generic expression of m1(.|@3 U 04), one knows also that mı (01|03U 64) = 0 and m1 (01 U 


03/03 U 04) = 0. Thus the knowledge of Bel, (f|p) = 1 — es implies to have 
mı (03/03 U 94) = [mi (63) + mi (01 U 03) + mi (02 U 03) + m (01 UU 63)|/ Kı =1-e 


This is however not sufficient to fully define the values of all components of ma (.|03U64) or equivalently 
of all components of mi (.). To complete the derivation without extra unjustified specific information, one 
needs to apply the minimal commitment principle (MCP) which states that one should never give more 
support to the truth of a proposition than justified [8]. According to this principle, we commit a non 
null value only to the less specific proposition involved into m1(03|@3 U 64) expression. In other words, 
the MCP allows us to choose legitimately 

mi (01) = mi (02) = mi (83) =0 
mi (0, U 62) = mi (0, U 03) = mi (02 U 03) = 0 


mi (01 U 0 U 03) A 0 
Thus Ky = 1 and m1(03/03 U 64) reduces to 
mı (03103 U 04) = mi (0 UU 03) =1-e 


Since the sum of basic belief assignments must be one, one must also have for the remaining (uncom- 


mitted for now) masses of m; (.) the constraint 


mi (04) + mi (01 U 64) + mi (02 U 04) + mi (01 U0,U 94) 





+m) (03 U 64) + mi (01 U 43 U 04) + mi (02 U ĝ3 U 94) 


+m (01 U @2 U 03 U 04) = €1 
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By applying a second time the MCP, one chooses m/ (61 U 02 U 03 U 04) = €1. 


Finally, the complete and less specific belief assignment ma (.|p) compatible with the available prior 


information Bel, (f|p) = 1 — 1 provided by the source B, reduces to 


mı (03103 U 04) = m; (01 U 02 U 03) E l—e (12.18) 
mı (03 U 64/03 U 84) = m/ (01 U 02 U 83 U 04) = €1 (12.19) 
or equivalently 
mi(f N plp) =m(puf)=1-e (12.20) 
mı(plp) = mi (PU FUpU f) =& (12.21) 


It is easy to check, from the mass m;(.|p), that one gets effectively Bel (f|p) = 1 — 1. Indeed: 


Bel; (f|p) = Bel; (0 U 63|p) 
Beli (flp) = Bela ((f 9 P) U (F A p)lp) 


Beli (flp) = mi(f N plo) +m (FA plp) 
0 


+ma((f Np) U (FNA plp) 
0 


Bel; (flp) = mı (FN plp) 


Bel: (Flp) =1-¢ 
In a similar way, for the source B2 with O defined as 
0, = {0, = £6,062 bN f,03 = fb,04 E fad} 


schematically represented by 


one looks for ma(.|b) = [ms O m3](.) with m3(b) = m5(03 U 64) = 1. From the MCP, the condition 
Belo(f|b) = 1 — ez and with simple algebraic manipulations, one finally gets 
ma(03|03 U 64) = mi (01 U A U 03) =1- €2 (12.22) 


ma(03 U 64/03 U 84) = m4 (01 U 02 U 03 U 04) = €2 (12.23) 
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or equivalently 


mo(f N b|b) = m4 (bU f) =1— ez (12.24) 


ma(blb) = m (bU fUbU f) =e (12.25) 
In a similar way, for the source B3 with O; defined as 
O; = {01 £ bN J, 92 Ê bN P, 03 = pN b, O4 = bN p} 


schematically represented by 


one looks for m3(.|p) = [mz € m3](.) with m3 (p) = m3 (03 U 64) = 1. From the MCP, the condition 
3 3 3 3 


Bels (b|p) = 1 — ez and with simple algebraic manipulations, one finally gets 


m3(63|93 U 04) = m4 (01 U 02 U 03) = 1 — €3 (12.26) 
ma (03 U 84|03 U 04) = m3 (01 U 02 U 83 U 04) = €3 (12.27) 
or equivalently 
ms(bN plp) = ms(pUb) = 1—e3 (12.28) 
ms(p|p) = m;(bU PU bU p) = ez (12.29) 


Since all the complete prior basic belief assignments are available, one can combine them with the 
Dempster’s rule to summarize all our prior knowledge drawn from our simple rule-based expert system 


characterized by rules R = {r1, r2, r3} and convictions/confidences W = {w1, we, w3} in these rules. 


The fusion operation requires to primilarily choose the following frame of discernment © (satisfying 


Shafer’s model) given by 
O = (01, 92, 03, 04, 05, 06, 07, Og} 


where 


6,2 fNbNp 6,2 fNbNp 
6,2 fNbND bs E fNbND 
6,2 fAbDAp 67 = fNAbDAp 


6,2 fNbND 6g = fnbnp 
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The fusion of masses m1(.) given by eqs. (12.20)-(12.21) with ma(.) given by eqs. 229-0225) 
using the Demspter’s rule of combination yields m42(.) = [mi 6 ma](.) with the following non null 


components 


mai2(f Mbp) = (1 — €2)/Ki2 
miz(f NON p) = e(l — &1)/Kı2 


mi2(bN p) = €1€2/K12 


with Kio £ 1 — (1 — e1 )(1 — €2) =e, + €2 — €1€2. 


The fusion of all prior knowledge by the Dempster’s rule m123(.) = [m1 9 M2 E mg](.) = [m12 9 ma] (.) 


yields the final result : 


mio3(f NbN p) = mi23(01) = e1(1 — €2)/ K123 
miz3(f N bN p) = mi23(05) = e2(1 — €1)/K123 


mi23(bN p) = m123(61 U 05) = €1€2/ K123 


with K123 = Kye 44 - (1 = end = €2) = €] + €2 — €1€2. 


which defines actually and precisely the conditional belief assignment m123(.|p Nb). It turns out that the 
fusion with the last basic belief assignment m3(.) brings no change with respect to previous fusion result 


m12(.) in this particular problem. 


Since we are actually interested to assess the belief that our observed particular penguin-animal named 
Tweety (denoted as T = (pN b)) can fly, we need to combine all our prior knowledge m493(.) drawn from 
our rule-based system with the belief assignment mo(T = (pNb)) = 1 characterizing the observation 
about Tweety. Applying again the Demspter’s rule, one finally gets the resulting conditional basic belief 


function Mo123 = [Mo O m123](.) defined by 
Mo123(T = (f N bA DIT = (p A b)) = e(l = €2)/ Ki 


Mo123(T = (fn bN p)|T = (pn b)) = €(1 = €1)/Ki 


Mor23 (T = (b N DIT = (p @) b)) = €1€2/Ky2 





From the Dempster-Shafer reasoning, the belief and plausibity that Tweety can fly are given by 


Bel(T = f|T = (pnb)) = 5 Mo123(T = z|T = (pN b)) 


1E28,1Cf 


PLT = FT = (pN b)) = 5 Mo123(T = 2|T = (pN b)) 
xE2° 1040 
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Because f = (f NbN p)U(fNbNDP)U(fNbNp)U(f NbN DP) and the specific values of the masses 


defining mo123(.), one has 
Bel(T = f|T = (pN b)) = moia3(T = (f NON p)|T = (pN b)) 


PUT = f|T = (pN b)) = morz3(T = (f NON p)|T = (p N b)) + moraa(T = (69 p)IT = (pN b)) 


and finally 
i 
Bel(T = f|T = (pnb) = BU 2) (12.30) 
Kia 
PUT = f|T = (pnb)) = elle), ee ese (12.31) 
Ki Ki Ki 
In a similar way, one will get for the belief and the plausibility that Tweety cannot fly 
z 1— 
Bel(T = fIT = (pnt) = 20) (12.32) 
Ki 
PUT = FIT = (pnb)) = ell-a), ae _ e (12.33) 
Ki Ki Kız 


Using the first order approximation when e, and ez are very small positive numbers, one gets finally 








Bel(T = f|T = (pN b)) = PUT =f[T=(pnb) = — 
€1 + €2 
In a similar way, one will get for the belief that Tweety cannot fly 
Bel(T = f|T = (pN b)) = PUT = FIT = (pn b)) » — 
€1 + €2 


This result coincides with the Judea Pearl's result but a different analysis and detailed presentation 
has been done here. It turns out that this simple and complete analysis corresponds actually to the 
ballooning extension and the generalized Bayesian theorem proposed by Smets in [21] [24] and discussed 
by Shafer in although it was carried out independently of Smets’ works. As pointed out by Judea 
Pearl, this result based on DST and the Dempster’s rule of combination looks very paradoxical/counter- 
intuitive since it means that if nonflying birds are very rare, i.e. €2 = 0, then penguin-birds like our 
observed penguin-bird Tweety, have a very big chance of flying. As stated by Judea Pearl in [I] pages 
448-449: ”The clash with intuition revolves not around the exact numerical value of Bel(f) but rather 
around the unacceptable phenomenon that rule r3, stating that penguins are a subclass of birds, plays no 
role in the analysis. Knowing that Tweety is both a penguin and a bird renders Bel(T = f|T = (pA b)) 
solely a function of mi(.) and ma(.), regardless of how penguins and birds are related. This stands 
contrary to common discourse, where people expect class properties to be overridden by properties of more 
specific subclasses. While in classical logic the three rules in our example would yield an unforgivable 


contradiction, the uncertainties attached to these rules, together with Dempster’s normalization, now 
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render them manageable. However, they are managed in the wrong way whenever we interpret if-then 
rules as randomized logical formulas of the material-implication type, instead of statements of conditional 
probabilities”. Keep in mind that this Pearl’s statement is however given to show the semantic clash 
between the Dempster-Shafer reasoning vs. the fallacious Bayesian reasoning to support the Bayesian 


reasoning approach. 


12.5 The Dezert-Smarandache reasoning 


We analyze here the Tweety penguin triangle problem with the DSmT (see Part I of this book for a 
presentation of DSmT). The prior knowledge characterized by the rules R = {r1, 72,73} and convictions 
W = {wi, we, w3} is modeled as three independent sources of evidence defined on separate minimal and 
potentially paradoxical (i.e internal conflicting) frames O, £ {p, f}, O2 £ {b, f} and Oz £ {p,b} since 
the rule rı doesn’t refer to the existence of b, the rule rə doesn’t refer to the existence of p and the rule 
r3 doesn't refer to the existence of f or f. Let's note that the DSmT doesn’t require the refinement of 
frames as with DST (see previous section). We follow the same analysis as in previous section but now 


based on our DSm reasoning and the DSm rule of combination. 


The first source 6, relative to rı with confidence wı = 1 — €, provides us the conditional belief 
Bel, (f|p) which is now defined from a paradoxical basic belief assignment ma (.) resulting of the DSm 
combination of m/(p) = 1 with m//(.) defined on the hyper-power set D® = {0,p, f,pN f,pU f}. The 
choice for m{(.) results directly from the derivation of the DSm rule and the application of the MCP. 
Indeed, the non null components of mı(.) are given by (we introduce explicitly the conditioning term in 


notation for convenience): 


The information Bel;(f|p) = 1 — €, implies 


Bel; (flp) = mi(f|p) + mi(pn flp) =1-e« 
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Since mi(plp) + mi(p N flp) = 1, one has necessarily mi(flp) = 0 and thus from previous equation 


mi( FM plp) = 1— e, which implies both 


mi(plp) = €1 
1 1 


mp0 flip) = mi (p) m4 (f) + mi (p) m4 (pN f) = mi (f) + mi (pn f=- e 


Applying the MCP, it results that one must choose 


mi(f)=1—e and mi(pnf)=0 


The sum of remaining masses of m4(.) must be then equal to €, i.e. 


mi(p) + my (pU f) = 41 
Applying again the MCP on this last constraint, one gets naturally 


mi(p)=0 and mi(pUf)=e 


Finally the belief assignment m1 (.]p) relative to the source Bı and compatible with the constraint 
Bel; (f|p) = 1 — 1, holds the same numerical values as within the DST analysis (see eqs. (12.20)-(12.21)) 


and is given by 


mı(pN flp) =1-e 


mi (pp) = €1 





but results here from the DSm combination of the two following assignments (i.e. mi(.) = [m] 9m1]() = 


[mi e mi](.)) 
mi(f)=1-e and mi(pUf)=e 
(12.34) 
mi(p) =1 
In a similarly manner and working on 02 = {b, f } for source B2 with the condition Bela(f|b) = 1— ez, 


the mass m2(.|b) results from the internal DSm combination of the two following assignments 


ma[f) =1—e and mi(bU f) =e. 
(12.35) 


ma(b) =1 
Similarly and working on 03 = {p, b} for source B3 with the condition Bels(b|p) = 1 — ez, the mass 
m3(.|p) results from the internal DSm combination of the two following assignments 


m3(b)=1—e3; and mx(bUp) =€3 
(12.36) 


m3(p) = 1 
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It can be easily verified that these (less specific) basic belief assignments generates the conditions 


Beli (f |p) =1- El; Belo(f |b) =1- €2 and Bels (b|p) =1- €3. 


Now let’s examine the result of the fusion of all these masses based on DSmT, i.e by applying the 


DSm rule of combination of the following basic belief assignments 


mi(pN flp)=1—e and m1(plp) = «1 
m2(bN flb) =1—ez and ma(b|b) = ez 
malpnblp) =1—ez and malplp) = €3 


Note that these basic belief assignments turn to be identical to those drawn from DST framework 
analysis done in previous section for this specific problem because of integrity constraint f N f = Ø and 
the MCP, but result actually from a slightly different and simpler analysis here drawn from DSmT. So 
we attack the TP2 with the same information as with the analysis based on DST, but we will show that 


a coherent conclusion can be drawn with DSm reasoning. 


Let's emphasize now that one has to deal here with the hypotheses/elements p, b, f and f and thus our 
global frame is given by O = {b, p, f, f}. Note that O doesn’t satisfy Shafer’s model since the elements of 
O are not all exclusive. This is a major difference between the foundations of DSmT with respect to the 
foundations of DST. But because only f and f are truly exclusive, i.e. f O f =, we are face to a quite 
simple hybrid DSm model M and thus the hybrid DSm fusion must apply rather than the classic DSm 
rule. We recall briefly here (a complete derivation, justification and examples can be found in chapter 
Ø) the hybrid DSm rule of combination associated to a given hybrid DSm model for k > 2 independent 


sources of information is defined for all A € DP as: 
moy (4) E (A) [S1(4) + S2(4) + $3(A) (12.37) 


where @(A) is the characteristic non emptiness function of the set A, i.e. 6(A) = 1 if A¢ 0 (0 £ {0,0} 


being the set of all relatively and absolutely empty elements) and ¢(A) = 0 otherwise, and 


51(4) + a [[ mx) (12.38) 


X1,X2,... XEDE i=1 


aa ze [] mx) (12.39) 
X1,X2,..., XkE0 =i 
[U=A]V[(UEB) A(A=14)] 


k 
cle 2 TEZES (12.40) 
X1,X2,0.., Xy ED? i=1 
(X1UXQU...UX,)=A 
(XA1NX2N...NXp)E0 
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with U £ u(X1) U u(X2) U...Uu(X;) where u(X) is the union of all singletons 6; that compose X and 
I, £0, U02U...U 9p, is the total ignorance defined on the frame O = {6,,...,0,}. For example, if X is 
a singleton then u(X) = X; if X = 6102 or X = 6; U 2 then u(X) = 0, U 82; if X = (81 N 02) U 83 then 
u(X) = 01 U6 U 63; by convention u(0) £ 0. 


The first sum $;(A) entering in the previous formula corresponds to mass myF(@)(A) obtained by 
the classic DSm rule of combination based on the free DSm model M? (i.e. on the free lattice DÌ). The 
second sum S2(A) entering in the formula of the hybrid DSm rule of combination (12.37) represents the 
mass of all relatively and absolutely empty sets which is transferred to the total or relative ignorances. 
The third sum $3(A) entering in the formula of the hybrid DSm rule of combination transfers 
the sum of relatively empty sets to the non-empty sets in the same way as it was calculated following the 
DSm classic rule. 

To apply the hybrid DSm fusion rule formula (12:37), it is important to note that (pNf)A(bNf)Np = 
pnbnfaf = because fN f = 0, thus the mass (1 —€,)(1—€2)e3 is transferred to the hybrid proposition 


Hı = (pN f) U (bN f) Up = (bN f) U p; similarly (pN f)N (WN Ff)N (pnb) = pnbnfnf =O 





because f N f = Ø and therefore its associated mass (1 — €1)(1 — €2)(1 — ez) is transferred to the hybrid 
proposition Hz £ (pN f)U(bN f) U (pAb). No other mass transfer is necessary for this Tweety Penguin 
Triangle Problem and thus we finally get from hybrid DSm fusion formula the following result 
for m123(.|[p 1b) = [m1 $ ma 6 m3](.) (where @ symbol corresponds here to the DSm fusion operator and 


we omit the conditioning term pN b here due to space limitation): 


mız3((b N f) U plp N b) = (1 — e) (1 — e2)e3 
miz ((pPN F) U (EN f) U (pNd)lpN b) = (1 — e) (1 — €2)(1 — ez) 


m423(p N bA flpo b) = (1 — €1)€2€3 mite (1 = €1)€2(1 m €3) = (1 — €1)€2 











mi23(p NbN flipnb) = e(l €2)€3 e(l €2)(1 €3) = €1(1 — €2) 











m423(p N blp b) = €1€2€3 + €1€9(1 = €3) = €1€2 


We can check all these masses add up to 1 and that this result is fully coherent with the rational 


intuition especially when ez = 0, because non null components of m123(.|p O b) reduces to 


my23((pN f) U (bN f) U (pN b)|pb) = (1 — e1)(1 — €2) 
mi3(pNbN f\pnb) = (1 — €1)e2 


mi3(p NbN flp NAb) = a(1— es) 





m423(p N blp N b) = €1€2 
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which means that from our DSm reasoning there is a strong uncertainty (due to the conflicting rules 
of our rule-based system), when e, and €2 remain small positive numbers, that a penguin-bird animal 
is either a penguin-nonflying animal or a bird-flying animal. The small value e¡€e2 for m123(p N blp N b) 
expresses adequately the fact that we cannot commit a strong basic belief assignment only to pN b know- 
ing p Mb just because one works on O = {p,b, f, ff and we cannot consider the property p N b solely 


because the” birdness” or ” penguinness” property endow necessary either the flying or non-flying property. 


Therefore the belief that the particular observed penguin-bird animal Tweety ( corresponding to 
the particular mass mo(T = (p N b)) = 1) can be easily derived from the DSm fusion of all our prior 


summarized by mı23(.|p N b) and the available observation summarized by m,(.) and we get 


mors(T = (pn bn FIT = (pNb)) = (1— ex)ez 

mor23(T = (PN bN f)|T = (pNb)) = a(1 — e2) 
mo123(T = (pNb)|T = (pN b)) = erez 

mor23(T = (bN f) Up|T = (pNb)) = (1 — er) (1 — €2)es 


Mo123(T = (pN F) U (ON f) U (pNb)|T = (pNb)) = (1 — e) (1 — €2)(1 — es) 

















From the DSm reasoning, the belief that Tweety can fly is then given by 
Bel(T = f\T =(pNb))= Y mors(T = alT = (pnb)) 
TEDO Cf 


Using all the components of mo123(.|T = (pM b)), one directly gets 
Bel(T = FIT = (p N b)) = Mo123(T = (f N bN p)|T = (pn b)) 


and finally 
Bel(T = f|T = (pN b)) = e1(1 — ez) (12.41) 


In a similar way, one will get for the belief that Tweety cannot fly 
Bel(T = f|T = (pN b)) = ea(1 — ex) (12.42) 


So now for both cases the beliefs remain very low which is normal and coherent with analysis done 
in section [2.3.2] Now let's examine the plausibilities of the ability for Tweety to fly or not to fly. These 
are given by 


PUT = f|T =(pnb))4 5 moi23(T = x|T = (pN b)) 
EDS INTA 


PUT = f|T = (pN b)) £ 5 Moi23(T = x|T = (pN b)) 
EDO INTA 
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which turn to be after elementary algebraic manipulations 


PUT = f|T = (pN b)) = (es) (12.43) 


PI(T = f|T = (pN b)) = (1 — &) (12.44) 


So we conclude, as reasonably /rationally expected, that we can’t decide on the ability for Tweety of 


flying or of not flying, since one has 


[Bel(f|p N b), Pl(flp N b)] = [ea (1 — ez), (1 — €2)] ~ [0, 1] 
[Bel(flp N b), P(F]p N b)] = [e2(1 — €1), (1 — e1)] ~ [0, 1] 


Note that when setting e, = 0 and es = 1 (or e, = 1 and ez = 0), i.e. one forces the full consistency 
of the initial rules-based system, one gets coherent result on the certainty of the ability of Tweety to not 


fly (or to fly respectively). 


This coherent result (radically different from the one based on Dempster-Shafer reasoning but starting 


with exactly the same available information) comes from the hybrid DSm fusion rule which transfers some 





parts of the mass of empty set m(0) = (1 — e)(1 — e2)e3 + (1 — e) (1 — €2)(1 — ez) = 1 onto propositions 
(WN f) Up and (pN f)U (BN f)U (pnb). 


It is clear however that the high value of m(() in this TP2 indicates a high conflicting fusion problem 
which proves that the TP2 is a true almost impossible problem and the fusion result based on DSmT 
reasoning allows us to conclude on the true undecidability on the ability for Tweety of flying or of not 
flying. In other words, the fusion based on DSmT can be applied adequately on this almost impossible 
problem and concludes correctly on its indecibility. Another simplistic solution would consist to say 


naturally that the problem has to be considered as an impossible one just because m(@) > 0.5 . 


12.6 Conclusion 


In this chapter we have proposed a deep analysis of the challenging Tweety Penguin Triangle Problem. 
The analysis proves that the Bayesian reasoning cannot be mathematically justified to characterize the 
problem because the probabilistic model doesn’t hold, even with the help of acceptance of the principle 
of indifference and the conditional independence assumption. Any conclusions drawn from such repre- 
sentation of the problem based on a hypothetical probabilistic model are based actually on a fallacious 
Bayesian reasoning. This is a fundamental result. Then one has shown how the Dempster-Shafer reason- 


ing manages in what we feel is a wrong way the uncertainty and the conflict in this problem. We then 
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proved that the DSmT can deal properly with this problem and provides a well-founded and reasonable 


conclusion about the undecidability of its solution. 
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Chapter 13 


Estimation of Target Behavior 


Tendencies using DSmT 
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Bulgarian Academy of Sciences 29 Av. de la Division Leclerc 

Sofia, Bulgaria 92320 Chatillon, France 


Abstract: This chapter presents an approach for target behavior tendency es- 
timation (Receding, Approaching). It is developed on the principles of Dezert- 
Smarandache theory (DSmT) of plausible and paradoxical reasoning applied to con- 
ventional sonar amplitude measurements, which serve as an evidence for correspond- 
ing decision-making procedures. In some real world situations it is difficult to finalize 
these procedures, because of discrepancies in measurements interpretation. In these 
cases the decision-making process leads to conflicts, which cannot be resolved using 
the well-known methods. The aim of the performed study is to present and to ap- 
prove the ability of DSmT to finalize successfully the decision-making process and to 
assure awareness about the tendencies of target behavior in case of discrepancies in 
measurements interpretation. An example is provided to illustrate the benefit of the 
proposed approach application in comparison of fuzzy logic approach, and its ability 


to improve the overall tracking performance. 


This chapter is based on a paper E presented during the International Conference on Information Fusion, Fusion 2003, 
Cairns, Australia, in July 2003 and is reproduced here with permission of the International Society of Information Fusion. 
This work has been partially supported by MONT grants I-1205/02, 1-1202/02 and by Center of Excellence BIS21 grant 
ICA1-2000-70016 
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13.1 Introduction 


A tracking systems based on sonars are poorly developed topic due to a number of complica- 
tions. These systems tend to be less precise than those based on active sensors, but one important 
advantage is their vitality of being stealth. In a single sensor case only direction of the target as an 
axis is known, but the true target position and behavior (approaching or descending) remain unknown. 
Recently, the advances of computer technology lead to sophisticated data processing methods, which 
improve sonars capability. A number of developed tracking techniques operating on angle-only measure- 
ment data use additional information. In our case we utilize the measured emitter’s amplitude values in 
consecutive time moments. This information can be used to assess tendencies in target’s behavior and, 
consequently, to improve the overall angle-only tracking performance. The aim of the performed study 
is to present and to approve the ability of DSmT to finalize successfully the decision-making process 
and to assure awareness about the tendencies of target behavior in case of discrepancies of angle-only 
measurements interpretation. Results are presented and compared with the respective results, but drawn 


from the fuzzy logic approach. 


13.2 Statement of the Problem 


In order to track targets using angle-only measurements it is necessary to compensate the unknown ranges 
by using additional information received from the emitter. In our case we suppose that in parallel with 
measured local angle the observed target emits constant signal, which is perceived by the sensor with 
a non-constant, but a varying strength (referred as amplitude). The augmented measurement vector at 
the end of each time interval k = 1,2,...is Z = {Z, Za}, where: Zp = 0 + vo denotes the measured 
local angle with zero-mean Gaussian noise vg = N(0,0,,) and covariance 0,,; Za = A + va denotes 
corresponding signal’s amplitude value with zero-mean Gaussian noise va = N(0,0,,) and covariance 
ova. The variance of amplitude value is because of the cluttered environment and the varying unknown 
distance to the object, which is conditioned by possible different modes of target behavior (approaching 
or descending). Our goal is, utilizing received amplitude feature measurement, to predict and to estimate 


the possible target behavior tendencies. 


Figure [13.1] represents a block diagram of the target's behavior tracking system. Regarding to the 
formulated problem, we maintain two single-model-based Kalman-like filters running in parallel using two 
models of possible target behavior - Approaching and Receding. At initial time moment k the target is 
characterized by the fuzzified amplitude state estimates according to the models A^PP (k|k) and AR°¢(k|k). 
The new observation Z4(k + 1) = A(k + 1) + va(k + 1) is assumed to be the true value, corrupted by 


additive measurement noise. It is fuzzified according to the chosen fuzzification interface. 
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Figure 13.1: Block diagram of target’s behavior tracking system 


The tendency prediction approach is based on Zadeh compositional rule. The updating procedure uses 
Dezert-Smarandache classical combination rule based on the free DSm model to estimate target behavior 
states. Dezert-Smarandache Theory assures a particular framework where the frame of discernment is 
exhaustive but not necessarily exclusive and it deals successfully with rational, uncertain or paradoxical 
data. In general this diagram resembles the commonly used approaches in standard tracking systems [B], 


but the peculiarity consists in the implemented particular approaches in the realizations of the main steps. 


13.3 Approach for Behavior Tendency Estimation 


There are a few particular basic components in the block diagram of target’s behavior tracking system. 


13.3.1 The fuzzification interface 


A decisive variable in our task is the transmitted from the emitter amplitude value A(k), received at 
consecutive time moments k = 1,2,.... We use the fuzzification interface (fig. L32), that maps it into 
two fuzzy sets defining two linguistic values in the frame of discernment O = {S £ Small, B * Big}. 
Their membership functions are not arbitrarily chosen, but rely on the inverse proportion dependency 


between the measured amplitude value and corresponding distance to target. 
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Fuzzification Interface 











Small Big 


Amplitude Membership Function 
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1 fi 
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Figure 13.2: Fuzzification Interface 


The length of fuzzy sets’ bases provide design parameter that we calibrate for satisfactory performance. 
These functions are tuned in conformity with the particular dependency A ~ f(1/dD) known as a 
priori information The degree of overlap between adjacent fuzzy sets reflects amplitude gradients in the 


boundary points of specified distance intervals. 


13.3.2 The behavior model 


In conformity with our task, fuzzy rules’ definition is consistent with the tracking of amplitude changes 
tendency in consecutive time moments k = 1,2,.... With regard to this a particular feature is that 
considered fuzzy rules have one and the same antecedents and consequents. We define their meaning by 
using the prespecified in paragraph linguistic terms and associated membership functions (according to 


paragraph[13.3.1). We consider two essential models of possible target behavior: 


Approaching Target - it's behavior is characterized as a stable process of gradually amplitude 


value increasing, i.e. the transition S — S — B — B is held in a timely manner; 


Receding Target - it's behavior is characterized as a stable process of gradually amplitude value 


decreasing, i.e. the transition B > B => S — S is held in a timely manner. 


To comprise appropriately these models the following rule bases have to be carried out: 
Behavior Model 1: Approaching Target: 
Rule 1: IF A(k) = S THEN A(k +1) =S 


Rule 2: IF A(k) = S THEN A(k +1) = B 


Rule 3: IF A(k) = B THEN A(k +1) = B 
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Behavior Model 2: Receding Target: 


Rule 1: IF A(k) = B THEN A(k +1) = B 





Rule 2: IF A(k) = B THEN A(k +1) = S 


Rule 3: IF A(k) = S THEN A(k+1)=S 


The inference schemes for these particular fuzzy models are conditioned on the cornerstone principle 
of each modeling process. It is proven [4], that minimum and product inferences are the most widely 
used in engineering applications, because they preserve cause and effect. The models are derived as fuzzy 
graphs: 

g = maxa; xp: (u, v)) = max(ua; (u) > ua, (0) (13.1) 


in which uA,xB;(u, v) = pa, (u) - uB,(v) corresponds to the Larsen product operator for the fuzzy con- 


junction, g = max; (HA; xB; ) is the maximum for fuzzy union operator and 


uB (y) = max(min(u4 (vi), LaxB(Ti, Yi))) 


is the Zadeh max-min operator for the composition rule. 


The fuzzy graphs related to the two models are obtained in conformity with the above described 
mathematical interpretations, by using the specified membership functions for linguistic terms Small, 
Big, and taking for completeness into account all possible terms in the hyper-power set DÌ = {S, B, S N 
B, SUB}: 


pea [ane [oso 





Relation 1: Approaching Target 


Jasa] 
E E CI foal 0 
sa plo fo] 0 


s [io [a oo | 
sue jojo fol o | 


Relation 2: Receding Target 
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13.3.3 The amplitude state prediction 


At initial time moment k the target is characterized by the fuzzified amplitude state estimates according 
to the models 4 4app (klk) and y 4Rec(k|k). Using these fuzzy sets and applying the Zadeh maz-min com- 
positional rule [A] to relation 1 and relation 2, we obtain models’ conditioned amplitude state predictions 
for time k+1, i.e. pwaarp(k+1|k) is given by max(min(u 4rro (k|k), HApp(k — k+1))) and prec (k + 1]k) 
by mazx(min(4 aree (k|k), URec(k > k + 1))). 


13.3.4 State updating using DSmT 


The classical DSm combinational rule is used here for state updating. This procedure is realized on 
the base of fusion between predicted states according to the considered models (Approaching, Receding) 
and the new measurement. Since D® is closed under U and N operators, to obey the requirements to 
guarantee that m(.) : DO + [0,1] is a proper general information granule, it is necessarily to transform 
fuzzy membership functions representing the predicted state and new measurement into mass functions. It 


is realized through their normalization with respect to the unity interval. Models’ conditioned amplitude 


state prediction vector Pre Rec() is obtained in the form: 
A/R A/R A/R A/R 
(pees (S), Hae N B), a (B), Mala (S U B)] (13.2) 


App/Rec 


brod represent the possibilities that the predicted amplitude 


In general the terms, contained in y 


behavior belongs to the elements of hyper-power set DY and there is no requirement to sum up to unity. 


App/Rec 


In order to use the classical DSm combinational rule, it is necessary to make normalization over Hpred 


to obtain respective generalized basic belief assigments (gbba) YC € DÌ = {5,5 B, B, S U B}: 


Pe AO) 

pp/Rec pre 

red (C) = A Rec (13.3) 
P aene Te (A) 


The equivalent normalization has to be made for the received new measurement before being fused 


with the DSm rule of combination. 


Example 


App/Rec 


Let's consider at scan 3 the predicted vector for the model Approaching red 


(4|3) with components 
u(S) = 0.6, a(S N B) = 0.15, u(B) = 0.05 and a(S U B) = 0.0, then the normalization constant is 


K = 0.6 + 0.15 + 0.05 + 0.0 = 0.8 and after normalization, one gets the resulting gbba 


a 


ec 0. ec 0.15 
mipe/Ree (gy = 2 = 0.75 Mora (SB) = = = 0.1875 


App/Rec _ 0.05 = App/Rec — 0.0 2 
me (B) = = = 0.0625 md (SUB) = = = 0.0 
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That way one can obtain marr! ae ) as a general (normalized) information granule for the prediction 


of the target’s behavior. 


App/Rec 


App/Rec( ) at measurement time is then obtained from mea le 


The target behavior estimate m, upå 
and the amplitude belief assignment Mmes(B) (built from the normalization of the new fuzzyfied crisp 
amplitude measurement received) by the DSm rule of combination, i.e. 

maa (C) = [mpri O mmes\(C)= Y] mpra O (A)Mmes(B) (13.4) 
A,BED®, ANB=C 

Since in contrast to the DST, DSmT uses a frame of discernment, which is exhaustive, but in general 
case not exclusive (as it is in our case for O = {S, B}), we are able to take into account and to utilize 
the paradoxical information SN B although being not precisely defined. This information relates to the 
case, when the moving target resides in an overlapping intermediate region, when it is hard to predict 
properly the tendency in its behavior. Thus the conflict management, modeled that way contributes to 
a better understanding of the target motion and to assure awareness about the behavior tendencies in 


such cases. 


13.4 The decision criterion 


It is possible to build for each model M = (A)pproaching, (R)eceding a subjective probability measure 


Piva(-) from the bba mp, 4(.) with the generalized pignistic transformation (GPT) [3][6] defined VA € D? 
by 
Crs (CN A) 
M _ M M 
Piar = y Cus(C) Mupa(C) (13.5) 


CED®|ANCHO 
where C ms (X) denotes the DSm cardinal of proposition X for the free DSm model Mf of the problem 
under consideration here. The decision criterion for the estimation of correct model M is then based on 
the evolution of the Pignistic entropies, associated with updated amplitude states: 


Bk Pea ) Í — Y A at A} In(P, a A}) (13.6) 
AEV 


where V denotes the parts of the Venn diagram of the free DSm model Mf. The estimation M(k) of 
correct model at time k is given by the most informative model corresponding to the smallest value of 


the pignistic entropy between Aa and HR (PRa) 


13.5 Simulation study 


A non-real time simulation scenario is developed for a single target trajectory (fig[[3.3) in plane coor- 


dinates X,Y and for constant velocity movement. The tracker is located at position (0km, 0km). The 
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target's starting point and velocities are: (xp = 5km, yo = 10km), with following velocities during the 


two part of the trajectory (1 = 100m/s, y = 100m/s) and (t = —100m/s, y = —100m/s). 


i Target Motion 














Figure 13.3: Target trajectory. 
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Figure 13.4: Measurements statistics. 


The time sampling rate is T = 10s. The dynamics of target movement is modeled by equations: 
x(k) =x(k-1)+4T and y(k) =y(k-1) +yT 


The amplitude value Za(k) = A(k) + va(k) measured by sonar is a random Gaussian distributed process 
with mean A(k) = 1/D(k) and covariance o4(k) (fig. 3. D(k) = \/x?(k) + y?2(k) is the distance to 
the target, (x(k), y(k)) is the corresponding vector of coordinates, and v4(k) is the measurement noise. 
Each amplitude value (true one and the corresponding noisy one) received at each scan is processed 


according to the block diagram (figure 13.1). 
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Figure 13.5: Behavior tendencies (Noise-free measurements). 





Behavior Tendencies (Noisy Measurements) 
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Figure 13.6: Behavior Tendencies (Noisy measurements). 


Figures and [13.6] show the results obtained during the whole motion of the observed target. 
Figure[[35] represents the case when the measurements are without noise, i.e. Z(k) = A(k). Figure [13.0] 
represents the case when measured amplitude values are corrupted by noise. In general the presented 
graphics show the estimated tendencies in target behavior, which are described via the scan consecutive 


transitions of the estimated amplitude states. 


Figure [13.7] represents the evolution of pignistic entropies associated with updated amplitude states 
for the Approaching and Receding models in case of noisy measurements; the figure for the noise-free 
measurement is similar. It illustrates the decision criterion used to choose the correct model. If one takes 


a look at the figure[I3_5Jand figure [37] it can be seen that between scans 1st and 15th the target motion 
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is supported by Approaching model, because that mode corresponds to the minimum entropies values, 


which means that it is the more informative one. 


Noisy Case 
0.8 T T T 








—— Model Approaching 
0.7 F p —=- Model Receding 











Variation of Pignistic Entropy for Updated States 














0.1 f ñ f f ñ f L L L 
0 10 20 30 40 50 60 70 80 90 100 


Scans 


Figure 13.7: Evolution of the pignistic entropy for updated states. 


The Approaching model is dominant, because the measured amplitude values during these scans stable 
reside in the state Big, as it is obvious from the fuzzification interface (fig[3.2). In the same time, Reced- 
ing model supports the overlapping region S N B, which is transition towards the state Small. Between 
scans 16th and 90th the Receding model becomes dominant since the variations of amplitude changes 
are minimal and their amplitude values stable support the state Small. During these scans Approaching 
model has a small reaction to the measurement statistics, keeping paradoxical state S N B.What it is 
interesting and important to note is that between scans 16th and 30th the difference of entropies between 
Approaching and Receding models increases, a fact, that makes us to be increasingly sure that the Re- 
ceding mode is becoming dominant. Then, between scans 75th and 90th the difference of these entropies 
is decreasing, which means that we are less and less sure, that Receding model remain still dominant. 
After switching scan 91th the Approaching model becomes dominant one, until scan 100th. In general the 
reaction of the considered models to the changes of target motion is not immediate, because the whole 
behavior estimation procedure deals with vague propositions Small, Big, and sequences of amplitude 


values at consecutive scans often reside stable in one and the same states. 


Comparing the results in figure [13.6] with the results in figure [3.5] it is evident, that although some 
disorder in the estimated behavior tendencies, one can make approximately correct decision due to the 
possibility of DSmT to deal with conflicts and that way to contribute for a better understanding of target 


behavior and evaluation of the threat. 
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13.6 Comparison between DSm and Fuzzy Logic Approaches 


The objective of this section is to compare the results received by using DSm theory and respective 
results but drawn from the Fuzzy Logic Approach (FLA) Ø [8] [9], applied on the same simulation sce- 
nario. The main differences between the two approaches consist in the domain of considered working 
propositions and in the updating procedure as well. In present work, we use DSm combination rule to 
fuse the predicted state and the new measurement to obtain the estimated behavior states, while in the 
fuzzy approach state estimates are obtained through a fuzzy set intersection between these entities. It 
is evident from the results, shown in figures and [13.9] that here we deal with only two proposi- 
tions O = {Small, Big}. There is no way to examine the behavior tendencies in the overlapping region, 


keeping into considerations every one of possible target’s movements: from SN B to B or from SNB to S. 
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Figure 13.8: Behavior Tendencies drawn from FLA (NoisyFree Measurements). 
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Figure 13.9: Behavior Tendencies without Noise Reduction drawn from FLA (Noisy Case). 
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Figure[13.8]shows the noise-free measurement case. It could be seen that between scan 10 and 90 target 
motion is supported by the correct for that case Receding model, while Approaching one has no reaction 
at all. If we compare corresponding figure [3.5] (DSm case) and present figure [133] we can see, that in 
the case of DSm approach Receding model reacts more adequately to the true target tendency , because 
there is a possibility to deal with the real situation — the tendency of the target to make a movement 
from B to the overlapping region BN S. In the FLA case there is no such opportunity and because of 
that between scan 1st and 10th Receding model has no reaction to the real target movement towards the 
Bo S. Figure[i3.9]represents the case when the measured amplitude values are corrupted by noise. It 
is difficult to make proper decision about the behavior tendency, especially after scan 90th., because it 
is obvious, that here the model Approaching coincide with the model Receding. In order to reduce the 
influence of measurement noise over tendency estimation, an additional noise reduction procedure has 
to be applied to make the measurements more informative. Its application improves the overall process 
of behavior estimation. Taking in mind all the results drawn from DSmT and FLA application, we can 


make the following considerations: 


e DSmT and FLA deal with a frame of discernment, based in general on imprecise/vague notions 
and concepts O = {S,B}. But DSmT allows us to deal also with uncertain and/or paradoxical 
data, operating on the hyper-power set DP? = {S, S B,B,SU B}. In our particular application 
it gives us an opportunity for flexible tracking the changes of possible target behavior during the 


overlapping region SN B. 


e DSmT based behavior estimates can be characterized as a noise resistant, while FLA uses an 


additional noise reduction procedure to produce 'smoothed” behavior estimates. 


13.7 Conclusions 


An approach for estimating the tendency of target behavior was proposed. It is based on Dezert- 
Smarandache theory applied to conventional sonar measurements. It was evaluated using computer 
simulation. The provided example illustrates the benefits of DSm approach in comparison of fuzzy logic 
one. Dealing simultaneously with uncertain and paradoxical data, an opportunity for flexible and ro- 
bust reasoning is realized, overcoming the described limitations relative to the fuzzy logic approach. 
It is presented and approved the ability of DSmT to ensure reasonable and successful decision-making 
procedure about the tendencies of target behavior in case of discrepancies of angle-only measurements 
interpretation. The proposed approach yields confident picture for complex and ill-defined engineering 


problems. 
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Chapter 14 


Generalized Data Association for 


Multitarget Tracking in Clutter 


A. Tchamova, T. Semerdjiev, P. Konstantinova J. Dezert 
Institute for Parallel Processing ONERA 
Bulgarian Academy of Sciences 29 Av. de la Division Leclerc 
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Abstract: The objective of this chapter is to present an approach for target track- 
ing in cluttered environment, which incorporates the advanced concept of generalized 
data (kinematics and attribute) association (GDA) to improve track maintenance 
performance in complicated situations (closely spaced and/or crossing targets), when 
kinematics data are insufficient for correct decision making. It uses Global Nearest 
Neighbour-like approach and Munkres algorithm to resolve the generalized associ- 
ation matriz. The main peculiarity consists in applying the principles of Dezert- 
Smarandache theory (DSmT) of plausible and paradoxical reasoning to model and 
process the utilized attribute data. The new general Dezert-Smarandache hybrid rule 
of combination is used to deal with particular integrity constraints associated with 
some elements of the free distributive lattice. The aim of the performed study is to 
provide coherent decision making process related to generalized data association and 
to improve the overall tracking performance. A comparison with the corresponding 


results, obtained via Dempster-Shafer theory is made. 
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14.1 Introduction 


ne important function of each radar surveillance system in cluttered environment is to keep and 
Or targets’ tracks maintenance performance. It becomes a crucial and challenging problem 
especially in complicated situations of closely spaced, and/or crossing targets. The design of a modern 
multitarget tracking (MTT) algorithms in a such real-life stressful environment motivates the incorpora- 
tion of the advanced concepts for generalized data association. In order to resolve correlation ambiguities 
and to select the best observation-track pairings, in this study, a particular generalized data association 
(GDA) approach is proposed and incorporated in a MTT algorithm. It allows the introduction of target 
attribute into the association logic, based on the general Dezert-Smarandache rule for combination, which 
is adapted to deal with possible integrity constraints on the problem under consideration due to the true 
nature of the elements involved into it. This chapter extends recent research work published in [15] which 


was limited to target tracking in clutter-free environment. 


14.2 Basic Elements of Tracking Process 


The tracking process consists of two basic elements: data association and track filtering. The first element 


is often considered as the most important. Its goal is to associate observations to existing tracks. 


14.2.1 Data Association 


To eliminate unlikely observation-to-track pairing at the begining a validation region (gate) is formed 
around the predicted track position. The measurements in the gate are candidates for association to the 


corresponding track. 


14.2.1.1 Gating 


We assume zero-mean Gaussian white noise for measurements. The vector difference between received 
measurement vector z;(k) and predicted measurement vector 2;(k|k — 1) of target i is defined to be 


residual vector (called innovation) 
Bij (k) = 25 (k) — ĉi (k|k — 1) 


with residual covariance matrix S = HPH’ + R, where P is the state prediction covariance matrix, H 
is the measurement matrix and R is the measurement covariance matrix [2] [3] [4] [5]. The scan indexes k 
will be dropped for notational convenience. The norm (normalized distance function) of the innovation 
is evaluated as: 


2 =! gle. 
dj, =Z¡¡S Žij 
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One defines a threshold constant for gate y such that correlation is allowed if the following relationship 
is satisfied 
dj < Y (14.1) 
Assume that the measurement vector size is M. The quantity di is the sum of the squares of M 
independent Gaussian random variables with zero means and unit standard deviations. For that reason 
dij will have x3, distribution with M degrees of freedom and allowable probability of a valid observation 
falling outside the gate. The threshold constant y can be defined from the table of the chi-square (x%g) 
distribution [3]. 


14.2.1.2 Generalized Data Association (GDA) 


If a single observation is within a gate and if that observation is not within a gate of any other track, the 
observation can be associated with this track and used to update the track filter. But in a dense target 
environment additional logic is required when an observation falls within the gates of multiple target 


tracks or when multiple observations fall within the gate of a target track. 


When attribute data are available, the generalized probability can be used to improve the assignment. 
In view of independence of the kinematic and attribute measurement errors, the generalized probability 


for measurement j originating from track i is: 
Peen (i; j) = Peli, 7) Pali, j) 


where P,(i,7) and Pa(i, j) are kinematic and attribute probability terms respectively. 


Our goal is to choose a set of assignments {Xij}, for i = 1,...n and j = 1,...,m, that assures 


maximum of the total generalized probability sum. To find it, we use the solution of the assignment 


problem 
n m 
min 22 aixi 
i=1 j=1 
where: 
1 if measurement j is assigned to track ¿ according to assignment problem solution 
Xij = 


0 otherwise 
If, in the attempt to maximize the number of assignments, the assignment algorithm chooses a pairing 


that does not satisfy the gate, the assignment is later removed. 


Because our probabilities vary 0 < P,(i,7), Pa(i, j) < 1 and to satisfy the condition to be minimized, the 


elements of the particular assignment matrix are defined as : 


Qij = 1-— Poen (0,7) =1- Py (i, 9) Pali, j) 
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14.2.2 Filtering 


The used tracking filter is the first order extended Kalman filter [7] for target state vector x = [x 2 y yl”, 
where x and y are Cartesian coordinates and ¢ and y are velocities along Cartesian axes and measurement 
vector z = [3 D]', where 8 is the azimuth (measured from the North), and D is the distance from the 


observer to the target under consideration. 


The measurement function h(.) is (assuming the sensor located at position (0,0)): 


h(x) = [hi(x) ha(x]' = farctan(-") Va? + y?! 


and the Jacobian [8]: 


We assume constant velocity target model. The process noise covariance matrix is: Q = 02Qy, where T 


is the sampling/scanning period, o, is standard deviation of the process noise and Qr is given by [8]: 


Ti T 

Qr = diag(Q2x2,Q2x2) with Qua=|%  ” 
TL T2 
2 


The measurement error matrix is R = diag(o3,0;) where og and øp are the standard deviations of 


measurement errors for azimuth and distance. 


The track initiation is performed by two-point differencing [7]. After receiving observations for first 
two scans the initial state vector is estimated by x = [2(2) 20) y(2) y where (x(1), y(1)) and 
(x(2), y(2)) are respectively the target positions at the first scan for time stamp k = 1, and at the second 
scan for k = 2. The initial (starting at time stamp k = 2) state covariance matrix P is evaluated by: 


2 
e. 


) 


2 
o 

P = diag(P3x2,P3x2) with PiS EN ak 

T 


where the index (.) must be replaced by either x or y indexes with o2 ~ of sin? (z6) + z303 cos? (z8) and 
2 


a? ~ oh cos? (zg) + 2503 sin?(zg). zg and zp are the components of the measurement vector received at 


scan k = 2, i.e. z = [zg zp|' = h(x) + w with w ~ N(0,R). 


14.3 The Attribute Contribution to GDA 


Data association with its goal of partitioning observations into tracks is a key function of any surveillance 
system. An advanced tendency is the incorporation of generalized data (kinematics and attribute) asso- 
ciation to improve track maintenance performance in complicated situations, when kinematics data are 


insufficient for coherent decision making process. Analogously with the kinematic tracking, the attribute 
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tracking can be considered as the process of combining information collected over time from one or more 
sensors to refine the knowledge about the evolving attributes of the targets. The motivation for attribute 
fusion is inspired from the necessity to ascertain the targets’ types, information, that in consequence has 
an important implication to enhance the tracking performance. A number of techniques, probabilistic in 
nature are available for attribute fusion. Their analysis led us to belief, that the theory of Dempster- 
Shafer is well suited for representing uncertainty, but especially in case of low conflicts between the bodies 
of evidence. When the conflict increases and becomes high, (case, which often occurs in data association 
process) the combinational rule of Dempster hides the risk to produce indefiniteness. To avoid that sig- 
nificant risk we consider the form of attribute likelihood function within the context of DSm theory, i.e. 
the term to be used for computing the probabilities of validity for data association hypotheses. There 


are a few basic steps, realizing the concept of attribute data association. 


14.3.1 The Input Fuzzification Interface 


Fuzzification interface (see fig. [4.I) transforms numerical measurement received from a sensor into fuzzy 
set in accordance with the a priori defined fuzzy partition of input space-the frame of discernments ©. 
This frame includes all considered linguistic values related to the chosen particular input variable and 
their corresponding membership functions. The fuzzification of numerical sensory data needs dividing an 
optimal membership into a suitable number of fuzzy sets [[4]. Such division provides smooth transitions 


and overlaps among the associated fuzzy sets, according to the particular real world situation. 


Fuzzification Interface 
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Figure 14.1: Fuzzification interface 
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The considerable input variable in the particular case is the Radar Cross Section (RCS) of the ob- 
served targets. In our work the modeled RCS data are analyzed to determine the target size with the 
subsequent declaration that the observed target is an aircraft of specified type (Fighter, Cargo) or False 
Alarms. Taking it in mind, we define two frames of discernments: first one according to the size of 
RCS: ©, = {Very Small (VS), Small (S), Big (B)} and the second one determining the corresponding to 
its Target Type O = {False Alarms (FA), Fighter (F), Cargo (C)}. 


The radar cross section according to the real targets is modeled as Swerling 3 type, where the density 


function for the RCS ø is given by: 








with the average RCS (cave) varying between different targets’ types [16]. The cumulative distribution 
function of the radar cross section is given by 


200 200 








Flo) = P{0 < o < oo} =1— (1+ ) exp| 


ave Cave 
Since the probabilities F (co) for having different values of radar cross section are uniformly distributed 
in the interval [0,1] over time (i.e. these values are uncorrelated in time), a sample of observation of the 
RCS can be simulated by solving equation: 


2 
a+ 





) exp[— 


Oave Oave 





J=l-«a 


where x is arandom number that is uniformly distributed between 0 and 1. 


The scenario considered in our work deals with targets’ types Fighter (F) and Military Cargo (C) 
with an average RCS : 


F _ 2 C _ yp? 
Cave = 1.2m and One = 4M 


The radar cross section according to the False Alarms [I] is modeled as Swerling 2 type, where the 


density function for the RCS is given by: 








1 
Fo) = exp[— 7 ] with Tavo = 0.3 M? 


Oave Oave 


The cumulative distribution function is given by 


F(00) = P{0 < 0 < oo} = 1 — exp|- 2 


ave 





] 


A sample of observation of the RCS can be computed by solving equation: 





a J=1-x 


Cave 


exp[— 


where x is a random number that is uniformly distributed between 0 and 1. 
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The input fuzzification interface maps the current modeled RCS values into three fuzzy sets: VerySmall, 
Small and Big, which define the corresponding linguistic values, defining the variable ” RCS”. Their 
membership functions are not arbitrarily chosen, but rely on the calculated respective histograms for 
10000 Monte Carlo runs. Actually these fuzzy sets form the frame of discernements 01. After fuzzification 


the new RCS value (rcs) is obtained in the form : 


rcs > [UverySmall (res), LUSmall (res), PBig (rcs)] 


In general, the grades verySman(1CS), USman(1CS), MBig(res) represent the possibilities the new RCS value 
to belong to the elements of the frame ©, and there is no requirement to sum up to unity. Figure [4.2] 
below shows the way which the new observations for Cargo, Fighter and False Alarms are modeled for 
500 Monte Carlo runs, using the corresponding Swerling type functions type 3 and 2. It is evident that 
they are too much mixed. It influences over the distinction between them. That fact hides the possibility 
of intrinsic conflicts between the fused bodies of evidence (general basic belief assignment (gbba) of tar- 
gets' tracks and observations), because of their imprecise belief functions and consequently yields a poor 
targets tracks’ performance. To deal successfuly with such kind of stressful, but real situation, we need 


DSm theory to process flexibly and adequately these conflicts. 


Modeled RCS pala tor 500 Monte Carlo Pups 
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Figure 14.2: Simulation of RCS values over 500 Monte Carlo runs 
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14.3.2 Tracks’ Updating Procedures 
14.3.2.1 Using Classical DSm Combinational Rule 


After receiving the new observations, detected during the current scan k, to obey the requirements to 
guarantee that their particular belief assignment m(.) are general information granules, it is necessary 
to transform each measurement’s set of fuzzy membership grades into the corresponding mass function, 


before being fused. It is realized through normalization with respect to the unity: 


Lic (res) 
Mmeas C)= == TA VC € 0] = VS,S,B 
O = Poco, HOS) ee 


The general basic belief assignments (gbba) of tracks’ histories are described in terms of the hyper- 


power set : 
D®: = {9, VS,S,B, VSN SN B, VSN S, VSN B, SNB, (VS US) NB, (VSUB)NS, 
(SUB) N VS, (VSN S) U (VSN B) U (SA B), (VS N S) UB, (VSN B) US, 
(SNB) U VS, VSU S, VSUB,SUB,VSUSU B} 


Then DSm classical combinational rule (see chapter [I) is used for tracks’ updating: 
m (C) = [Mhist O Mesa 160 E y Mist (Ait eas (B) 
A,BED®1,ANB=C 


where m” 


upd are 


meas 


(.) represents the gbba of the updated track i with the new observation j; Miis, mi 


respectively gbba vectors of track’s ¿ history and the new observation 7. 


It is important to note, that for us the two considered independent sources of information are the 
tracks’ histories and the new observations with their gbbas maintained in terms of the two hyper-power 
sets. That way we assure to obtain and to keep the decisions according to the target types during all the 


scans. 


Since, DSmT uses a frame of discernment, which is exhaustive, but in general case not exclusive, 
we are able to take into account and to utilize the paradoxical information VS N S N B, VSNS,VSNB 
and SN B. This information relates to the cases, when the RCS value resides in an overlapping regions, 
when it is hard to make proper judgement about the tendency of behavior of its value. Actually these 
nonempty sets and related to it mass assignments contribute to a better understanding of the overall 


tracking process. 


14.3.2.2 Using Hybrid DSm Combinational Rule 


As it was mentioned above in our work, RCS data here are used to analyze and subsequently to de- 


termine the specified type of the observed targets. Because of this it is maintained the second frame of 
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discernement O2 = {False Alarm (FA), Fighter (F), Cargo (C)}, in terms of which the decisions according 


to target types have to be made. Doing this, we take in mind the following correspondencies: 
e If rcs is Very Small then the "target” is False Alarm 
e If rcs is Small then the target is Fighter 
e If rcs is Big then the target is Cargo 


We may transform the gbba of updated tracks, formed in D® into respective gbba in D®?, i.e: 
mupalCoenez) = MipalCoener) 


But let us go deeper into the meaning of the propositions in the second hyper-power set. It should 
be: 
D® = {9, FA, F,C, FAN FNC, FAN F,FANC,F NC, (FAUF) NC, (FAUC) NF, 
(FUC) NFA, (FAN F)U (FAN C)U (FNC), (FANF)UC, (FAN C) UF, 
(FO C)UFA,FAUF,FAUC,FUC,FAUFUC} 
In the real life however, it is a proven fact, that the target can not be in one and the same time 
FalseAlarm and Fighter; FalseAlarm and Cargo; Fighter and Cargo; FalseAlarm and Fighter and Cargo. 


It leads to the following hybrid DSm model M ,(@2), built by introducing the following exclusivity 


constraints (see chapter for a detailed presentation of the hybrid DSm models and the hybrid DSm rule 
of combination): 


FANF gp ranco raco ranerne £g 


These exclusivity constraints imply directly the following ones: 


Mi 


IIIS 


(FAUFINC'= 0 (FANF)UC=C 


NES 
IIIS 


(FAUC)NF= 6 (FANC)UF’= F 


(FUC)AFA 0 (FNC)UFA F FA 


and also the more generalized one 


(FAN F)U(FANC)U(FNC) = g 


The obtained that way model corresponds actually to Shafer’s model, which can be considered as a par- 


ticular case of the generalized free DSm model. 
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Therefore, while the corresponding sets in D®! are usually non empty, because of the exclusivity 


constraints, in the second frame Oz, the hyper-power set D©? is reduced to classical power set : 
DSi, = {0, FA, F, C, FAUF, FAUC, FUC, FAU FUC} 


So, we have to update the previous fusion result, obtained via the classical DSm rule of combination 
with this new information on the model M1(02) of the considered problem. It is solved with the hybrid 


DSm rule (see chapter Ø), which transfers the mass of these empty sets to the non-empty sets of DR? 


14.4 The Generalized Data Association Algorithm 


We consider a particular cluster and assume the existence of a set of n tracks at the current scan and a 
set of m received observations. A validated measurement is one which is either inside or on the boundary 
of the validation gate of a target. The inequality given in (14.1) is a validation test. It is used for filling 


the assignment matrix A : 


Q11 Q12 013 Alm 

Q21 Q22 4923 02m 
A = [Ajj] = 

An1 an2 an3 Anm 


The elements of the assignment matrix A have the following values [I3]: 


00 if dj > Y 
ai; = 

1— Peli, j)Pali j) EEN 
The solution of the assignment matrix is the one that minimizes the sum of the choosen elements. We 
solve the assignment problem by realizing the extension of Munkres algorithm, given in [10]. As a result, it 
obtains the optimal measurements to tracks association. Because of the considered crossing and/or closely 
spaced target scenarios, to produce the probability terms P, and Pa, the joint probabilistic approach is 
used [7]. It assures a common base for their defining, making that way them to be compatible. The 
joint probabilistic data association (JPDA) approach imposes restriction on the problem size because 
of exponential increasing of the number of generated hypotheses and the time for assignment problem 
solution. That’s why it is advisable to make clustering before solving data association problem. Cluster 
is a set of closely spaced objects. In our case if two tracks have an observation in their overlapping parts 
of the gates, the tracks form cluster i.e. their clusters are merged. In such a way the number of clusters 


are equal or less than the number of tracked tracks. The clustering is usefull at least for two reasons: 
1. In such a way the size of assignment matrix and also the time for its solution decreases; 


2. The number of hypotheses for JPDA like approach for defining kinematic and attribute probabilities 


also decreases. 
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In the worst case when all m measurements fall in the intersection of the validation regions of all n 


tracks, the number of hypotheses can be obtained as: 


min(n,m) 
>, CmAn 
i=0 


where 


a | ; 
ci, 4" _ ford<i<m and A = 


n! 
i!(m — i)! " (n— i)! 


With these formulae the number of hypotheses for various values of the m and n are computed and 


forO<i<n 


are shown in the following table. The enormous increasing of the number of hypothesis can be seen. 


13327 
394353 
58941091 


n = 10,m = 10 | 234662231 





Table 14.1: Worst case hypotheses number 


As further improvement, first k-best hypotheses can be used as the score of the hypotheses de- 
crease and a big amount of hypotheses practically does not influence the result. Another original frame 
of hypotheses generation has been considerably optimized in [9] and that way it becomes a practical 


alternative of Murty’s approach. 


To define the probabilities for data association for different scenarios with random number of false 


alarms we implement the following steps on each scan: 


1. Check gating - using information for the received observations and for tracked targets (at the 
moment) and for each pair (track i - observation j) check inequality (111). As a result an array 


presents each observation in which track’s gates is fallen. 
2. Clustering — define clusters with tracks and observations fallen in their gates. 
3. For each cluster: 


3.1 - Generate hypotheses following Depth First Search (DFS) procedure with certain constraints 
[7]. In the JPDAF approach, the two constraints which have to be satisfied for a feasible 


event are: 


(a) each observation can have only one origin (either a specific target or clutter), and 
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(b) no more than one observation originates from a target. 


As a result of hypotheses generation for each hypothesis is defined a set of numbers representing 
the observations assigned to the corresponding tracks, where the zero represents the assignment 


of no observation to a given track. 
3.2 - Compute hypothesis probabilities for kinematic and attribute contributions (detailed in the 
next paragraphs). 


3.3 - Fill assignment matrix, solve assignment problem and define observation to track association. 


14.4.1 Kinematics probability term for generalized data association 


On the basis of defined hypotheses, the kinematic probabilities are computed as: 


P'(Hy) = gNu—(Nr—-Nnp) (1 = Pay"? PaT ND) Il Jij 
1#0,j#0|(i,j)€Hı 


Nm being the number of observations in cluster, Nr the number of targets, N,p the number of not 
detected targets. (i,j) € H, involved in the product represents all the possible observation to track 
associations involved in hypothesis H).The likelihood function gij, associated with the assignment of 


observation j to track 1 is: 


2 
eB, /2 


(2032.18, 


P4 is the probability of detection and ( is the extraneous return density, that includes probability density 


Jij = 


for new tracks and false alarms: 
b = Bnr + Gra 
The normalized probabilities are computed as: 


__ P(H) 
Ple) 


where Ny is the number of hypotheses. To compute the probability P,(i,7) that observation j should be 


P, (A) 


assigned to track 7, a sum is taken over the probabilities P,(.) from those hypotheses H;, in which this 


assignment occurs. 


As an particular example for a cluster with two tracks and two new observations, see Fig. [14.3 
detected during the moment of their closely spaced movement, where P1 and P2 are the tracks’ predictions 
and O1, O2 are the received observations. The table shows the particular hypotheses for the 


alternatives with respect to targets tracks and associated probabilities. 
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Table 14.2: Target-oriented hypothesis based on kinematics. 


Figure 14.3: Scenario with two tracks and two observations 


14.4.2 Attribute probability terms for generalized data association 


The way of calculating the attribute probability term follows the joint probabilistic approach. 
P(“H)= [I ali) 
10,5 40|(i,5) eM 


where 
2 


de(ij) = 5 [Miis (C) — Méananist (C)] 
CEDO1 
where me, anist (C) is a candidate history of the track - result, obtained after the fusion via DSm classical 


rule of combination between the new received attribute observation j and predicted track's attribute state 


of the track i (the confirmed track history from the previous scan). 


In the case of existence of two tracks and two new observations, considered in previous section and 
on the basis of the hypotheses matrix, one can obtain the probabilities of the hypotheses according to 


the following table: 
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Table 14.3: Target-oriented hypothesis based on attributes. 


The corresponding normalized probabilities of association drawn from attribute information are obtained 


as: 
P" (Aj) 


ect P” (Hs) 


where Ny is the number of association hypotheses. 


P,(Hı) = 


To compute the probability P! (i, j) that observation j should be assigned to track i, a sum is taken over 
the probabilities P,(.) from those hypotheses A, in which this assignment occurs. Because the Euclidean 
distance is inversely proportional to the probability of association, the probability term Pa(i,j) = 1 — 


P! (i, j) is used to match the corresponding kinematics probability. 


14.5 Simulation scenarios 


14.5.1 Simulation scenariol: Crossing targets 


The simulation scenario consists of two air targets (Fighter and Cargo) and a stationary sensor at the 
origin with Tycan = 5 sec., measurement standard deviations 0.3 deg and 60 m for azimuth and range 
respectively. The targets movement is from West to East with constant velocity of 250 m/sec. The 
headings of the fighter and cargo are 225 deg and 315 deg from North respectively. During the scan 11th- 
14th the targets perform maneuvers with 2.5g. Their trajectories are closely spaced in the vicinity of the 
two crossing points. The target detection probabilities have been set to 0.99 for both targets and the 
extraneous return density 3 to 1078. In our scenario we consider the more complicated situations, when 
the false alarms are available. The number of false alarms are Poisson distributed and their positions are 


uniformly distributed in the observation space. 
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Figure 14.4: Typical simulation for scenario 1 - Two Crossing Targets’ Tracks 


14.5.2 Simulation scenario 2: Closely spaced targets 


The second simulation scenario is influenced by the recent works of Bar-Shalom, Kirubarajan and Gokberk 


[6], which considers a case of closely spaced ground targets, moving in parallel. Our case consists of four 


air targets (alternating Fighter,Cargo, Fighter,Cargo) moving with constant velocity of 100 m/sec. The 


heading at the begining is 155 [deg] from North. The targets make maneuvers with 0.85g - ( right, left , 


right turns). The sensor parameters and the false alarms are the same as in the first scenario. 
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Figure 14.5: Typical simulation for scenario 2 - Four Closely Spaced Air Targets’ Tracks 
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14.6 Simulation results 


In this section the obtained simulation results, based on 100 Monte Carlo runs are presented. The goal is 
to demonstrate how the attribute measurements contribute for the improvement of the track performance, 


especially in critical cases, when the tracks are crossing and/or closely spaced. 


14.6.1 Simulation results: Two crossing targets 


In the case when only kinematics data are available for data association (see fig. [46), it is evident 
that after scan 15 (the second crossing moment for the targets), the tracking algorithm loses the proper 


targets’ direction. 


Here the Tracks’ Purity performance criterion is used to examine the ratio of the right associations. 
Track purity is considered as a ratio of the number of correct observation-target associations (in case of 


detected target) over the total number of available observations during tracking scenario. 


The results from table [4.4] show the proper (observation-track) associations in that case. Here 
“missed” is used for the case when in the track’s gate there is no observation, and “FA” is used for the 


case, when the track is associated with the false alarm. 


— 1 | 0.7313 | 0.2270 | 0.0304 | 0.0113 
Track 2 | 0.2409 | 0.7035 | 0.0426 | 0.0130 


Table 14.4: Tracks’Purity in case of Kinematics Only Data Association (KODA). 





Table [14.5] shows the result, when attribute data are utilized in the generalized data association 
algorithm in order to improve the tracks’ maintenance performance. The hybrid DSm rule is applied to 
produce the attribute probability term in generalized assignment matrix. As a result it is obvious that 


the tracks’ purity increases 


eros Ja] 


Track 1 | 0.8252 | 0.1496 | 0.0165 | 0.0087 
Track 2 | 0.1557 | 0.8243 | 0.0165 | 0.0035 





Table 14.5: Tracks’ Purity in case of Generalized Data Association based on DSmT. 
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Figure 14.6: Performance of Tracking Algorithm with Kinematics Only Data Association 


14.6.2 Simulation results: Four closely spaced targets 


Figure [14.7] shows the performance of the implemented tracking algorithm with kinematics only data 
association. One can see that the four closely spaced moving in parallel targets lose the proper directions 


and the tracks switch. 
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Figure 14.7: Performance of Tracking Algorithm with Kinematics Only Data Association 
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The results in table [4.4 show the proper (observation-track) associations in that case. The corre- 


sponding results in case of GDA based on DSmT are described in table [14.7 


[Jor [om 2 [ome a [om aia] a 


Track 1 


Track 2 
Track 3 
Track 4 





Table 14.6: Tracks’ Purity in case of Kinematics Only Data Association. 
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Table 14.7: Tracks’ Purity with GDA based on DSmT. 


14.6.3 Simulation results of GDA based on Dempster-Shafer theory 


The results based on Dempster-Shafer theory for attribute data association are described in the tables 


below. For scenario 1 (two crossing targets), the tracks’ purity is obtained in table [4.8] For scenario 


2 (four closely spaced targets), the tracks’ purity performance is obtained in table [£9] 


Pepo Ja 


Track 1 | 0.7548 | 0.1609 | 0.0643 | 0.0200 
Track 2 | 0.2209 | 0.7548 | 0.0174 | 0.0070 





Table 14.8: Tracks' Purity with GDA based on Dempster-Shafer Theory (two crossing targets). 
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Table 14.9: Tracks’ Purity with GDA based on Dempster-Shafer Theory (four closely spaced targets). 


14.7 Comparative analysis of the results 


It is evident from the simulation results presented in previous sections, that in general the incorporated ad- 
vanced concept of generalized data association leads to improving of the tracks’ maintenance performance 
especially in complicated situations (closely spaced and/or crossing targets in clutter). It influenced over 
the obtained tracks’ purity results. In the same time one can see, that the tracks’ purity in case of using 
Dezert-Smarandache theory increases in comparison with the obtained one via Dempster-Shafer theory. 


Analysing all the obstacles making these simulations, it can be underlined that : 


e Dezert-Smarandache theory makes possible to analize, process and utilize flexibly all the paradoxical 
information - case, which is peculiar to the problem of multiple target tracking in clutter, when 
the conflicts between the bodies of evidence (tracks’ attribute histories and corresponding attribute 
measurements) often become high and critical. That way it contributes to a better understanding 
of the overall tracking situation and to producing an adequate decision. Processing the paradoxes 
(propositions, which are more specific than the others in the hyper-power set), the estimated entropy 
in the confirmed (via the right track-observation association) tracks’ attribute histories decreases 
during the consecutive scans. It can be seen on the last figure[[4.8] where the Pignistic entropy (i.e 
the Shannon entropy based on pignistic probabilities derived from the resulting belief mass [15] 50M) 
is estimated in the frame of 0; = {Very Small (VS), Small (S), Big (B)} and the corresponding 
hyper-power set D® (blue color curve on the top subfigure). Simulation steps show, that source of 
evidence here is a hybrid one - paradoxical and uncertain. In the same time the entropy of the track’s 
attribute history, described in the second frame 02 = {False Alarm (FA), Fighter (F), Cargo (C)} 
(red color curve on the bottom subfigure) increases. It can be explained with the applied here 
hybrid DSm model M,(@2), built by introducing the exclusivity constraints, imposed by the real 
life requirements (section [[4.3.2.2). The obtained that way model corresponds actually to Shafer’s 
model, which is a particular case of hybrid DSm model (the most constrained one). Therefore, 


while the corresponding sets in D® are usually non empty, because of the exclusivity constraints, 
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in the second frame O, the hyper-power set is reduced to: 
DÌ? = {0, FA, F,C, FA UF, FAUC, FUC, FAU FU C} 


So, it is obvious, in that frame, the track’s attribute history represents uncertain source of infor- 
mation. Here the entropy increases with the uncertainty during the consequtive scans, because all 
the masses assigned to the empty sets in DÌ? are transferred to the non-empty sets, in our case 


actually to the uncertainty. 


Pignistic Entropy Variation in Updated Track in the Frame (VerySmall,Small,Big) 
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Figure 14.8: Variation of Pignistic Entropy in Track’s Attribute History in the two frames O, and O2 


e Because of the Swerling type modelling, the observations for False Alarms, Fighter and Cargo are 
too much mixed. That fact causes some conflicts between general basic beliefs assignments of 
the described bodies of evidence. When the conflict becomes unity, it leads to indefiniteness in 
Dempster's rule of combination and consequently the fusion process can not be realized. From the 
other side, if the received modeled measurement leads to track's attribute update, in which the 
unity is assigned to some particular elementary hypothesis, after that point, the combinational rule 
of Dempster becomes indifferent to any other measurements in the next scans. It means the track's 
attribute history remains the same, regardless of the received observations. It naturally leads to 


non coherent and non adequate decisions according to the right observation-to-tracks associations. 
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14.8 Conclusions 


In this work an approach for target tracking, which incorporates the advanced concept of generalized data 
(kinematics and attribute) association is presented. The realized algorithm is based on Global Nearest 
Neighbour-like approach and uses Munkres algorithm to resolve the generalized association matrix. The 
principles of Dezert-Smarandache theory of plausible and paradoxical reasoning to utilize attribute data 
are applied. Especially the new general hybrid DSm rule of combination is used to deal with particular 
integrity constraints associated with some elements of the free distributive lattice. A comparison with 
the corresponding results, obtained via Dempster-Shafer theory is made. It is proven, that Dempster- 
Shafer theory is well suited for representing uncertainty, but only in the cases of low conflicts between the 
bodies of evidence, while Dezert-Smarandache theory contributes to improvement of track maintenance 
performance in complicated situations (crossing and/or closely spaced targets), assuring a flexible and 


coherent decision-making, when kinematics data are insufficient to provide the proper decisions. 
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On Blackman’s Data Association 


Problem 
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Abstract: Modern multitarget-multisensor tracking systems involve the develop- 
ment of reliable methods for the data association and the fusion of multiple sensor 
information, and more specifically the partitioning of observations into tracks. This 
chapter discusses and compares the application of Dempster-Shafer Theory (DST) 
and the Dezert-Smarandache Theory (DSmT) methods to the fusion of multiple sen- 
sor attributes for target identification purpose. We focus our attention on the para- 
doxical Blackman's association problem and propose several approaches to outperform 
Blackman’s solution. We clarify some preconceived ideas about the use of degree of 


conflict between sources as potential criterion for partitioning evidences. 


15.1 Introduction 


he association problem is of major importance in most of modern multitarget-multisensor tracking 
Te This task is particularly difficult when data are uncertain and are modeled by basic 
belief masses and when sources are conflicting. The solution adopted is usually based on the Dempster- 
Shafer Theory (DST) [9] because it provides an elegant theoretical way to combine uncertain information. 


This chapter is based on a paper 4 presented during the International Conference on Information Fusion, Fusion 2003, 


Cairns, Australia, in July 2003 and is reproduced here with permission of the International Society of Information Fusion. 
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However Dempster’s rule of combination can give rise to some paradox/anomaly and can fail to provide 
the correct solution for some specific association problems. This has been already pointed out by Samuel 
Blackman in [2]. Therefore more study in this area is required and we propose here a new analysis 
of Blackman’s association problem (BAP). We present in the sequel the original BAP and remind the 
classical attempts to solve it based on DST (including Blackman’s method). In the second part of the 
chapter we propose and compare new approaches based on the DSmT with the free DSm model. The 
last part of the chapter provides a comparison of the performances of all the proposed approaches from 


Monte-Carlo simulation results. 


15.2 Blackman’s Data Association Problem 


15.2.1 Association Problem no. 1 


Let's recall now the original Blackman’s association problem [2]. Consider only two target attribute 
types corresponding to the very simple frame of discernment O = {61, 62} and the association /assignment 
problem for a single attribute observation Z and two tracks (Tı and T2). Assume now the following two 


predicted basic belief assignments (bba) for attributes of the two tracks: 
mr,(01) =0.5 mnr (02) =0.5 mr, (01 U 02) =0 
mr,(01) =0.1 mp,(0,) =0.1 mr, (01 U 02) = 0.8 
We now assume to receive the new following bba drawn from attribute observation Z of the system 
mz(01ı)=0.5 mz(02) =0.5 mz(01U02) =0 


The problem is to develop a general method to find the correct assignment of the attribute measure mz(.) 
with the predicted one mr,(.), i = 1,2. Since mz(.) matches perfectly with mr, (.) whereas mz(.) does 
not match with my, (.), the optimal solution is obviously given by the assignment (mz(.) > mr, (.)). The 
problem is to find an unique general and reliable method for solving this specific problem and for solving 


all the other possible association problems as well. 


15.2.2 Association Problem no. 2 


To compare several potential issues, we propose to modify the previous problem into a second one by 


keeping the same predicted bba mr, (.) and mr,(.) but by considering now the following bba mz(.) 
mz(01) = 0.1 mz(02) = 0.1 mz(01 U 02) = 0.8 


Since mz(.) matches perfectly with mr, (.), the correct solution is now directly given by (mz (.) > mx,(.)). 


The sequel of this chapter in devoted to the presentation of some attempts for solving the BAP, not only 
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for these two specific problems 1 and 2, but for the more general problem where the bba mz(.) does not 


match perfectly with one of the predicted bba mz,, i = 1 or i = 2 due to observation noises. 


15.3 Attempts for solutions 


We examine now several approaches which have already been (or could be) envisaged to solve the general 


association problem. 


15.3.1 The simplest approach 


The simplest idea for solving BAP, surprisingly not reported by Blackman in [2] is to use a classical 
minimum distance criterion directly between the predictions my, and the observation mz. The classical 
Lt (city-block) or L? (Euclidean) distances are typically used. Such simple criterion obviously provides 
the correct association in most of cases involving perfect (noise-free) observations mz(.). But there exists 
numerical cases for which the optimal decision cannot be found at all, like in the following numerical 
example: 


mT, (01) =0.4 mr, (02) =0.4 my, (01 U 92) = 0.2 
IMT (01) = 0.2 MT, (02) = 0.2 IMT (01 U 92) = 0.6 
mz(01) = 0.3 mz (62) = 0.3 mz (A, U 02) = 0.4 


From these bba, one gets dzi1(T1, Z) = dzi(To, Z) = 0.4 (or d¿2(T,,2) = dz2(Th, Z) = 0.24) and no 
decision can be drawn for sure, although the minimum conflict approach (detailed in next section) will 
give us instead the following solution (Z > T>). It is not obvious in such cases to justify this method 
with respect to some other ones. What is more important in practice [2], is not only the association 
solution itself but also the attribute likelihood function P(Z|T;) = P(Z + T;). As we know many 
likelihood functions (exponential, hyper-exponential, Chi-square, Weibull pdf, etc) could be build from 
drı(T;, Z) (or d¿2(T;, Z) measures but we do not know in general which one corresponds to the real 


attribute likelihood function. 


15.3.2 The minimum conflict approach 


The first idea suggested by Blackman for solving the association problem was to apply Dempster's rule 
of combination [9] mz,z(.) = [mr, 9 mz](.) defined by mx,z(0) = 0 and for any C# 9 and C CO, 


1 


dd T 


Y mr,(A)mz(B) 


ANB=C 
and choose the solution corresponding to the minimum of conflict kr,z. The sum in previous formula is 


over all A,B C O such that AN B = C. The degree of conflict kr,z between mr, and mz is given by 
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Xang- MT, (A)mz(B) #0. Thus, an intuitive choice for the attribute likelihood function is P(Z|T;) = 
1—kr,z. If we now apply Dempster’s rule for the problem 1, we get the same result for both assignments, 
Le. mr,z(.) = mpz(.) with mr;z(01) = mr,z(02) = 0.5 for i = 1,2 and mrz(6; U 02) = 0, and more 
surprisingly, the correct assignment (Z > Tı) is not given by the minimum of conflict between sources 
since one has actually (kr, z = 0.5) > (krz = 0.1). Thus, it is impossible to get the correct solution for 
this first BAP from the minimum conflict criterion as we firstly expected intuitively. This same criterion 
provides us however the correct solution for problem 2, since one has now (krz = 0.02) < (krz = 0.1). 
The combined bba for problem 2 are given by mx, (01) = m7,z(02) = 0.5 and mp, z(01) = mr,z(02) = 


0.17347, mp,z(01 U 02) = 0.65306. 


15.3.3 Blackman's approach 


To solve this apparent anomaly, Samuel Blackman has then proposed in [2] to use a relative, rather than 


an absolute, attribute likelihood function as follows 
L(Z|T,) = (1 — kr,2)/(1— kz) 


where km is the minimum conflict factor that could occur for either the observation Z or the track T; 
in the case of perfect assignment (when mz(.) and mx, (.) coincide). By adopting this relative likelihood 


function, one gets now for problem 1 





L(Z | Ti) = 7293 =1 





L(Z | T2) = Ee = 0.92 
Using this second Blackman’s approach, there is now a larger likelihood associated with the first 
assignment (hence the right assignment solution for problem 1 can be obtained now based on the max 
likelihood criterion) but the difference between the two likelihood values is very small. As reported by 
S. Blackman in [2], more study in this area is required and we examine now some other approaches. It 


is also interesting to note that this same approach fails to solve the problem 2 since the corresponding 


likelihood functions for problem 2 become now 








which means that the maximum likelihood solution gives now the incorrect assignment (mz(.) > mr, (.)) 


for problem 2 as well. 


15.3.4 Tchamova's approach 


Following the idea of section [5.3.1] Albena Tchamova has recently proposed in [3] to use rather the L+ 
(city-block) distance dı (T;, T;Z) or L? (Euclidean) distance d2(T;, T;Z) between the predicted bba mz, (.) 
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and the updated/combined bba m7, z(.) to measure the closeness of assignments with 
d(T,,T,2)= Y, | mr,(A) -mr z(A)| 
AE29 
1/2 


dre (Ta T:Z) =| > [mr (4) — mnz(A)]?] 
AE2° 


The decision criterion here is again to choose the solution which yields the minimum distance. This 
idea is justified by the analogy with the steady-state Kalman filter (KF) behavior because if z(k + 1) 
and 2(k + 1|k) correspond to measurement and predicted measurement for time k + 1, then the well- 
known KF updating state equation [I] is given by (assuming here that dynamic matrix is identity) 
&(k+1k+1) = ¿(k + 1k) + K(z(k +1) — 2(k + 1]k)). The steady-state is reached when z(k + 1) 
coincides with predicted measurement 2(k + 1|k) and therefore when ¿(k + 1|k +1) = &(k+1|k). In 
our context, mg, (.) plays the role of predicted state and mz, z(.) the role of updated state. Therefore it 
a priori makes sense that correct assignment should be obtained when mz, z(.) tends towards mr, (.) for 
some closeness/distance criterion. Monte Carlo simulation results will prove however that this approach 


is also not as good as we can expect. 


It is interesting to note that Tchamova’s approach succeeds to provide the correct solution for problem 
1 with both distances criterions since (d¿1(T7,,7,%) = 0) < (dzi(T2,T2Z) ~ 1.60) and (dz2(T1, TZ) = 
0) < (dz2(To,T2Z) ~ 0.98), but provides the wrong solution for problem 2 since we will get both 
(dpi (Ta, T2Z) ~ 0.29) > (d¿1(T,,T,2) = 0) and (dz2(To,T2Z) ~ 0.18) > dr: (T1, Tı Z) = 0). 


15.3.5 The entropy approaches 


We examine here the results drawn from several entropy-like measures approaches. Our idea is now to use 
as decision criterion the minimum of the following entropy-like measures (expressed in nats - i.e. natural 


number basis with convention 0 log(0) = 0): 


e Extended entropy-like measure: 


Heat(m) £ — X` m(A) log(m(A)) 
AE29 
e Generalized entropy-like measure [5] [8]: 
Hyen(m) = — Y m(A) log(m(A)/|A)) 


AE29 


e Pignistic entropy: 


Hpyetp(m) Ê — 5 P{0;} log(P{0:}) 


0,€0 
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where the pignistic(betting) probabilities P(6;) are obtained by 
vieo, P{ai}= > mm) 
BCO|O;EB 

It can be easily verified that the minimum entropy criterion (based on Heat, Hgen or Hperp) computed 
from combined bba mriz(.) or mraz(.) are actually unable to provide us correct solution for problem 
1 because of indiscernibility of mriz(.) with respect to mr2z(.). For problem 1, we get Hea:(mriz) = 
Hesi(mr2z) = 0.69315 and exactly same numerical results for Hyen and Hyetp because no uncertainty is 
involved in the updated bba for this particular case. If we now examine the numerical results obtained 
for problem 2, we can see that minimum entropy criteria is also unable to provide the correct solution 
based on Hert, Hgen or Hyetp criterions since one has Hen (mr2z) = 0.88601 > Hesi(mr1z) = 0.69315, 


Hgen(mr2z) = 1.3387 > Hgen(mriz) = 0.69315 and Apere(mriz) = Ayere(mrez) = 0.69315. 


These first results indicate that approaches based on absolute entropy-like measures appear to be 
useless for solving BAP since there is actually no reason which justifies that the correct assignment 
corresponds to the absolute minimum entropy-like measure just because mz can stem from the least 
informational source. The association solution itself is actually independent of the informational content 


of each source. 


An other attempt is to use rather the minimum of variation of entropy as decision criterion. Thus, 
the following minfA1(.), A2(.)} criterions are examined; where variations A;(.) for i = 1,2 are defined as 


the 
e variation of extended entropy: 


Ai( Hest) = Heat(mr,z) = Heat (mr, ) 
e variation of generalized entropy: 
As(Hyen) = Hgen(mr,z) — Hyen(mr,) 
e variation of pignistic entropy: 
A; (Hvete) = Hiep (mr, z) — Hoetp (Mr, ) 


Only the 2nd criterion, i.e. min(A;(Hyen)) provides actually the correct solution for problem 1 and 


none of these criterions gives correct solution for problem 2. 


The last idea is then to use the minimum of relative variations of pignistic probabilities of 0; and 02 


given by the minimum on ¿ of 


AP) 2 Y A (05)! 


j=1 
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where Pr,z(.) and Pr,(.) are respectively the pignistic transformations of mz,z(.) and mr, (.). Unfortu- 
nately, this criterion is unable to provide the solution for problems 1 and 2 because one has here in both 


problems A (P) = A2(P) = 0. 


15.3.6 Schubert’s approach 


We examine now the possibility of using a Dempster-Shafer clustering method based on metaconflict 
function (MC-DSC) proposed in Johan Schubert’s research works [6] [8] for solving the associations prob- 
lems 1 and 2. A DSC method is a method of clustering uncertain data using the conflict in Dempster’s 
rule as a distance measure [7]. The basic idea is to separate/partition evidences by their conflict rather 
than by their proposition’s event parts. Due to space limitation, we will just summarize here the principle 


of the classical MC- DSC method. 


Assume a given set of evidences (bba) E(k) = {mz,(.),i = 1,...,n} is available at a given index 
(space or time or whatever) k and suppose that a given set E(k + 1) £ {mz,(.),j = 1,...,m} of new 
bba is then available for index k +1. The complete set of evidences representing all available information 
at index k +1 is x= E(k) U E(k +1) £ {e1,...,eq} = {mz,(.),¢ = 1,...,n,mz,(.),j = 1,...,m} with 
q=n+m. The problem we are faced now is to find the optimal partition/assignment of x in disjoint 
subsets xp in order to combine informations within each Xp in a coherent and efficient way. The idea is 
to combine, in a first step, the set of bba belonging to the same subsets xp into a new bba m,(.) having 
a corresponding conflict factor kp. The conflict factors kp are then used, in a second step, at a metalevel 
of evidence associated with the new frame of discernment O = {AdP,—Adp} where AdP is short for 
adequate partition. From each subset xp, p = 1,...P of the partition under investigation, a new bba is 
defined as: 

my,(7AdP) £ kp and m,,(0) £1-—k, 


The combination of all these metalevel bba m,, (.) by Dempster's rule yields a global bba 
m(.) =mx()0 ... B Mxp(-) 


with a corresponding metaconflict factor denoted Mef(x1,..., xp) = ky,...,p. It can be shown [6] that the 
metaconflict factor can be easily calculated directly from conflict factors kp by the following metaconflict 


function (MCF) 
P 


Mefíxa,--.,xe)=1- [[( — hp) (15.1) 


p=1 


By minimizing the metaconflict function (i.e. by browsing all potential assignments), we intuitively 
expect to find the optimal/correct partition which will hopefully solve our association problem. Let's go 


back now to our very simple association problems 1 and 2 and examine the results obtained from the 
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MC-DSC method. 


The information available in association problems is denoted x = {m7,(.),mr,(.),mz(.)}. We now 
examine all possible partitions of x and the corresponding metaconflict factors and decision (based on 


minimum metaconflict function criterion) as follows: 


e Analysis for problem 1: 


— the (correct) partition xı = {m7,(.),mz(.)} and xa = {mr,(.)} yields through Dempter’s rule 
the conflict factors ky £ kriz = 0.5 for subset xı and ka = 0 for subset xa since there is 
no combination at all (and therefore no conflict) in x2. According to (15.1), the value of the 


metaconflict is equal to 
Mcf, =1- (1 = kDa = k2) =0.5 = kı 


— the (wrong) partition xı = (mp, (.)} and x2 = {mn (.),mz(.)} yields the conflict factors 


kı = 0 for subset xı and ka = 0.1 for subset x2. The value of the metaconflict is now equal to 
Mcfg = 1 — (1 — k1)(1 — k2) = 0.1 = ka 


— since Mcf, > Mcfo, the minimum of the metaconflict function provides the wrong assignment 


and the MC-DSC approach fails to generate the solution for the problem 1. 
e Analysis for problem 2: 


— the (wrong) partition xi = {mr7,(.),mz(.)} and x2 = ímr,(.)) yields through Dempter’s rule 
the conflict factors kı £ kr,z = 0.1 for subset xı and kg = 0 for subset xa since there is 
no combination at all (and therefore no conflict) in x2. According to (15.1), the value of the 


metaconflict is equal to 
Mcfi = 1 — (1 — k1)(1 — ka) = 0.1 = ky 


— the (correct) partition xı = {mr,(.)} and xa = {mn (.), mz(.)} yields the conflict factors 
kı = 0 for subset xı and ka = 0.02 for subset x2. The value of the metaconflict is now equal 
to 


Mcf = 1 — (1 — kı)(1 — k2) = 0.02 = k2 


— since Mcfə < Mcf;, the minimum of the metaconflict function provides in this case the correct 


solution for the problem 2. 


From these very simple examples, it is interesting to note that Schubert’s approach is actually exactly 


equivalent (in these cases) to the min-conflict approach detailed in section[[5.3.2Jand thus will not provide 
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unfortunately better results. It is also possible to show that Schubert’s approach also fails if one considers 
jointly the two observed bba mz, (.) and mz, (.) corresponding to problems 1 and 2 with mz, (.) and mr, (.). 
If one applies the principle of minimum metaconflict function, one will take the wrong decision since the 
wrong partition ((Z1,T3),(Z2,T1)) will be declared. This result is in contradiction with our intuitive 
expectation for the true opposite partition { (Z1, T1), (Z2, T2)} taking into account the coincidence of the 


respective belief functions. 


15.4 DSmT approaches for BAP 


As within DST, several approaches can be attempted to try to solve Blackman's Association prob- 
lems (BAP). The first attempts are based on the minimum on 7 of new extended entropy-like measures 
H* 


Ei (mr, z) or on the minimum Af, p(P*). Both approaches actually fail for the same reason as for the 


DST-based minimum entropy criterions. 


The second attempt is based on the minimum of variation of the new entropy-like measures as criterion 


for the choice of the decision with the new extended entropy-like measure: 
Ai( Hert) = HEzp(mr,z) — Hey (mr,) 
or the new generalized pignistic entropy: 
Ai(Hperp) = Hip (PA mr, zj) — Hip (PA mr,)) 


The min. of A;(HfZ,,) gives us the wrong solution for problem 1 since Ay(Hf,.,) = 0.34657 and 
Ao(Hz,.) = 0.30988 while min. of A;(Hf,,p) give us the correct solution since A1(H.¿p) = —0.3040 
and Ar(Hf..p) = —0.0960. Unfortunately, both the A,(Hf,,) and A;(H}.,p) criterions fail to pro- 


vide the correct solution for problem 2 since one gets Ay (HZ) = 0.25577 < A2(HZ,,) = 0.3273 and 
A Hop) = 0.0396 < Ac(Hk,p) = —0.00823. 


The third proposed approach is to use the criterion of the minimum of relative variations of pignistic 


probabilities of 6, and 02 given by the minimum on 7 of 


A,(P*) 2 > [Ph 2(03) - Pp, (05)1 
= Pr, (05) 


This third approach fails to find the correct solution for problem 1 (since A; (P*) = 0.333 > A2(P*) = 
0.268) but succeeds to get the correct solution for problem 2 (since A2(P*) = 0.053 < A¡(P*) = 0.066). 
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The last proposed approach is based on relative variations of pignistic probabilities conditioned by 
the correct assignment. The criteria is defined as the minimum of 
(pr) a APZ) -AP = T) 
Ai(P*Z = Ti) 
where A¿(P*|Z = T;) is obtained as for A¿(P*) but by forcing Z = T; or equivalently mz(.) = mr,(.) for 
the derivation of pignistic probabilities Pf, 7(0;). This last criterion yields the correct solution for problem 


1 (since d,(P*) = |0.333 — 0.333|/0.333 = 0 < 82(P*) = |0.268 — 0.053|/0.053 ~ 4) and simultaneously 
for problem 2 (since da(P*) = |0.053 — 0.053|/0.053 = 0 < 6(P*) = |0.066 — 0.333] /0.333 ~ 0.8). 


15.5 Monte-Carlo simulations 


As shown on the two previous BAP, it is difficult to find a general method for solving both these partic- 
ular (noise-free mz) BAP and all general problems involving noisy attribute bba mz(.). The proposed 
methods have been examined only for the original BAP and no general conclusion can be drawn from our 
previous analysis about the most efficient approach. The evaluation of the global performances /efficiency 
of previous approaches can however be estimated quite easily through Monte-Carlo simulations. Our 
Monte-carlo simulations are based on 50.000 independent runs and have been done both for the noise- 
free case (where mz(.) matches perfectly with either my, (.) or m7,(.)) and for two noisy cases (where 
mz(.) doesn’t match perfectly one of the predicted bba). Two noise levels (low and medium) have been 
tested for the noisy cases. A basic run consists in generating randomly the two predicted bba mz, (.) and 
mr, (.) and an observed bba mz(.) according to a random assignment mz(.) > mx, (.) or mz(.) => mn (.). 
Then we evaluate the percentage of right assignments for all chosen association criterions described in 
this chapter. The introduction of noise on perfect (noise-free) observation mz(.) has been obtained by 
the following procedure (with notation A; £ 01, A2 £ 02 and Ag £ 6; U 62): mo (Aj) = aymz(A;)/K 


3 noisy 


where K is a normalization constant such as )7;_, my 7” (A;) = 1 and weighting coefficients a; € [0; 1] 





are given by a; = 1/3 +e; such that yy a, = 1. 


The table 1 shows the Monte-Carlo results obtained with all investigated criterions for the following 
3 cases: noise-free (NF), low noise (LN) and medium noise (MN) related to the observed bba mz(.). 
The two first rows of the table correspond to simplest approach. The next twelve rows correspond to 


DST-based approaches. 
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Min L(Z|T;) 


Min dz1(T7;,7T;Z) 
Min dz2(T;,T;Z) 
Min Hext(mr,z) 
Min Ayen(mr,z) 
Min Hpetp(mr,z) 
Min A;(Hext) 
Min A; (Hyen) 
Min A; (Herp) 
Min A;(P) 





Min Mef; 


Table 1 : % of success of association methods 
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The table 2 shows the Monte-Carlo results obtained for the 3 cases: noise-free (NF), low noise (LN) 


and medium noise (MN) related to the observed bba mz(.) with the DSmT-based approaches. 


Min H¿,¿(mr,z) 
Min Hop (P*) 
Min A;(H3,4) 


Min Ai (Hosp) 


Min A,;(P*) 
Min 6;(P*) 





Table 2 : % of success of DSmT-based methods 


15.6 Conclusion 


A new examination of Blackman's association problem has been presented in this chapter. 





Several 


methods have been proposed and compared through Monte Carlo simulations. Our results indicate that 


the commonly used min-conflict method doesn't provide the best performance in general (specially w.r.t. 


the simplest distance approach). Thus the metaconflict approach, equivalent here to min-conflict, does 


not allow to get the optimal efficiency. Blackman's approach and min-conflict give same performances. 
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All entropy-based methods are less efficient than the min-conflict approach. More interesting, from the 
results based on the generalized pignistic entropy approach, the entropy-based methods seem actually 
not appropriate for solving BAP since there is no fundamental reason to justify them. The min-distance 
approach of Tchamova is the least efficient method among all methods when abandoning entropy-based 
methods. Monte Carlo simulations have shown that only methods based on the relative variations of 
generalized pignistic probabilities build from the DSmT (and the free DSm model) outperform all methods 
examined in this work but the simplest one. Analysis based on the DSmT and hybrid DSm rule of 


combination are under investigation. 
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Abstract: In situation analysis, an agent observing a scene receives information 
from heterogeneous sources of information including for example remote sensing de- 
vices, human reports and databases. The aim of this agent is to reach a certain 
level of awareness of the situation in order to make decisions. For the purpose of 
applications, this state of awareness can be conceived as a state of knowledge in the 
classical epistemic logic sense. Considering the logical connection between belief and 
knowledge, the challenge for the designer is to transform the raw, imprecise, con- 
flictual and often paradoxical information received from the different sources into 
statements understandable by both man and machines. Situation analysis appli- 
cations need frameworks general enough to take into account the different types of 
uncertainty and information present in the situation analysis context, doubled with a 
semantics allowing meaningful reasoning on situations. The aim of this chapter is to 
evaluate the capacity of neutrosophic logic and Dezert-Smarandache theory (DSmT) 


to cope with the ontological and epistemic problems of situation analysis. 
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16.1 Introduction 


he aim of Situation Analysis (SA) in a decision-making process is to provide and maintain a state of 
ion awareness for an agent observing a scene [I] [2]. For the purpose of applications, this state 
of awareness can be conceived as a state of knowledge in the classical epistemic logic sense. Considering 
the logical connection between belief and knowledge, the challenge for the designer is to transform the 
raw, imprecise, conflictual and often paradoxical information received from the different sources into 
statements understandable by both man and machines. Because the agent receives information from 
heterogeneous sources of information including for example remote sensing devices, human reports and 
databases, two simultaneous tasks need to be achieved: measuring the world and reasoning about the 
structure of the world. A great challenge in SA is the conciliation of both quantitative and qualitative 
information processing in mathematical and logical frameworks. As a consequence, SA applications 
need frameworks general enough to take into account the different types of uncertainty and information 
present in the SA context, doubled with a semantics allowing meaningful reasoning on belief, knowledge 
and situations. The formalism should also allow the possibility to encompass the case of multiagent 
systems in which the state of awareness can be distributed over several agents rather than localized. 

A logical approach based on a possible worlds semantics for reasoning on belief and knowledge in 
multiagent context is proposed in [3]. This work by Halpern and Moses can be used as a blueprint 
considering that it allows to handle numerical evaluations of probabilities, thus treating separately but 
nevertheless linking belief, knowledge and uncertainty. Related works are those of Fagin and Halpern [4] 
but also Bundy [5] which extend the probability structure of Nilsson [6] based on possible worlds semantics 
to a more general one close to the evidence theory developed by Dempster [7] and Shafer [8]. The result 
is the conciliation of both measures and reasoning in a single framework. 

Independently of these works has been introduced Neutrosophy, a branch of philosophy which studies 
neutralities and paradoxes, and relations between a concept and its opposite [9]. Two main formal 
approaches have emerged from Neutrosophy: neutrosophic logic, presented as a unified logic, of which 
fuzzy logic, classical logic and others are special cases [10] [I]; and Dezert-Smarandache theory (DSmT) 
that can be interpreted as a generalization of Dempster-Shafer theory. On one hand, neutrosophic logic 
appears as an interesting avenue for SA because (1) indeterminacy is explicitly represented by the means 
of an indeterminacy assignment, (2) falsity, truth and indeterminacy are represented independently (three 
distinct assignments), (3) it is a quantified logic, meaning that numerical evaluations of truth, falsity and 
indeterminacy values are allowed, (4) this quantification is allowed on hyperreals intervals, a generalization 
of intervals of real numbers given a broader frame for interpretations, (5) many novel connectives are 
defined (Neut-A, Anti-A, ...). On the other hand, being built on the hyper-power set of the universe of 
discourse, the DSmT allows to take into account the indeterminacy linked to the very definition of the 


individual elements of the universe of discourse, relaxing the mutual exclusivity hypothesis imposed by 
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the Dempster-Shafer theory (DST). This framework extends thus the DST by allowing a wider variety 
of events to be considered when measures become available. Indeed, a particularity of SA is that most 
of the time it is impossible beforehand to list every possible situation that can occur. The elements of 
the corresponding universe of discourse cannot, thus, be considered as an exhaustive list of situations. 
Furthermore, in SA situations are not clearcut elements of the universe of discourse. 

The aim of this chapter is to evaluate the potential of neutrosophic logic and Dezert-Smarandache 
theory (DSmT) to cope with the ontological and epistemic obstacles in SA (section [[6.3), i.e. problems 
due to the nature of things and to cognitive limitations of the agents, human or artificial. Section 
[16.4] exposes four basic principles guiding SA systems design in practice, and highlight the capacity of 
both neutrosophic logic and DSmT to cope with these principles. After brief formal descriptions of 
neutrosophic logic and DSmT (section [[6.5) we propose in section [[6.6] different extensions based on 
Kripke structures and Demspter-Shafer structures. In particular,a Kripke structure for neutrosophic 
propositions is presented in section [6.6.2] In the latter section, we assess the ability of neutrosophic 
logic to process symbolic and numerical statements on belief and knowledge using the possible worlds 
semantics. Moreover, we investigate the representation of neutrosophic concepts of neutrality and opposite 
in the possible worlds semantics for situation modelization. In section [6.6.3] after introducing Nilsson 
and Dempster-Shafer structures, we present a possible extension to DSmT. We also propose an example 
to illustrate the benefit of using a richer universe of discourse, and thus how DSmT appears as an 
appropriate modelling tool for uncertainty in SA. We then propose a possible connection between DSmT 
and neutrosophic logic in the Kripke structures setting (section 6.6.4). Finally, in section [16.7] we 


conclude on possible research avenues for using DSmT and neutrosophic logic in SA. 


16.2 Situation analysis 


The term situation appears in the mid-fourteenth century derived from medieval Latin situatio meaning 
being placed into a certain location. By the middle of the seventeenth century situation is used to discuss 
the moral dispositions of a person, more specifically the set of circumstances a person lies in, the relations 
linking this person to its milieu or surrounding environment. As will be shown below, the latter definition 
is close to what is meant today in the field of High-Level Data Fusion, where the mental state of situation 
awareness is studied in interaction with the surrounding environment. Common synonyms of situation 
with a corresponding meaning are setting, case, circumstances, condition, plight, scenario, state, picture, 
state of affairs. 

Although the notion of situation is used informally in everyday language to designate a given state 
of affairs, a simplified view of the world, and even the position of certain objects, situation is nowadays 


a central concept in High-Level Data Fusion where it has been given more or less formal definitions. For 
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Pew [12], a situation is “a set of environmental conditions and system states with which the participant is 


interacting that can be characterized uniquely by a set of information, knowledge, and response options” . 


16.2.1 Situation awareness as a mental state 


For Endsley and Garland [I] Situation awareness (SAW) is “the perception of the elements in the environ- 
ment within a volume of time and space, the comprehension of their meaning and the projection of their 
status in the near future”. SAW is also defined in [I3] as “the active mental representation of the status 
of current cognitive functions activated in the cognitive system in the context of achieving the goals of a 
specific task”. In particular, SAW involves three key tasks: (1) Perception, (2) Comprehension and (3) 


Projection, in a general multiagent context (Fig. 16.1). 









SITUATION AWARENESS 










Perception 
of elements in 
current 
situation 








Comprehension | Projection 
of current situation of future status 








Figure 16.1: The three basic processes of situation awareness according to Endlsey and Garland (modified 


from [I]), in a multiagent context. 


In contemporary cognitive science the concept of mental representation is used to study the interface 
between the external world and mind. Mental states are seen as relations between agents and mental 
representations. Formally, and following Pitt’s formulation [14], for an agent to be in a psychological state 
W with semantic property I is for that agent to be in a W-appropriate relation to a mental representation 
of an appropriate kind with semantic property I. As far as mental states are concerned, purely syntactic 
approaches are not adequate for representation since semantic concepts need to be modeled. 

Explicit reasoning on knowledge and the problems linked to its representation are distinctive features of 
situation analysis. Our position is to refer to the sources of knowledge usually considered in epistemology, 
namely, Perception, Memory, Reasoning, Testimony and Consciousness [15], and extend Endsley’s model 


of situation awareness [I] where perception appears as the only source of knowledge. 
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16.2.2 Situation Analysis as a process 


For Roy [2] “Situation Analysis is a process, the examination of a situation, its elements, and their re- 
lations, to provide and maintain a product, i.e. a state of Situation Awareness (SAW) for the decision 
maker”. For a given situation the SA process creates and maintains a mental representation of the situa- 
tion. Situation analysis corresponds to the levels 2, 3 and 4 of the JDL data fusion model [16][T7], hence to 
higher-levels of data fusion. A revisited version of the well-known model is presented on figure[L6.2] with 


classical applications associated to the different levels. A complete situation model must take into ac- 
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Figure 16.2: Revisited JDL data fusion model and applications [I8]. 


count the following tasks of: A. Situation perception composed of Situation Element Acquisition, Common 
Referencing, Perception Origin Uncertainty Management, and Situation Element Perception Refinement 
as subtasks. B. Situation comprehension composed of Situation Element Contextual Analysis, Situation 
Element Interpretation, Situation Classification, Situation Recognition, and Situation Assessment as sub- 
tasks. C. Situation projection composed of Situation Element Projection, Impact Assessment, Situation 
Monitoring. Situation Watch, and Process Refinement [2]. 

The conception of a system for SA must rely on a mathematical and/or logical formalism capable of 
translating the mechanisms of the SAW process at the human level. The formalism should also allow the 
possibility to encompass the case of multiagent systems in which the state of awareness can be distributed 
over several agents rather than localized. A logical approach based on a possible worlds semantics for 


reasoning on belief and knowledge is proposed in [3]. This work by Halpern and Moses can be used 
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as a blueprint considering that it allows to handle numerical evaluations of probabilities, thus treating 
separately but nevertheless linking belief, knowledge and uncertainty. 

Furthermore, mathematical and logical frameworks used to model mental states should be able to 
represent and process autoreference such as beliefs about one’s own beliefs, beliefs about beliefs about 


...and so on. 


16.2.3 A general model of a distributed system 


In 1990, Halpern and Moses proposed a model of distributed knowledge processing [3] that can be used 
for the purpose of situation analysis, as stated above. Short definitions are given below for the different 


components of the model: 


e A distributed system is a finite collection of two or more interacting agents A1,..., An (connected 


by a communication network); 


e The local state of an agent is the determined by the encapsulation of all the information an agent 


has access to at a given instant; 


e The state of the environment is defined as the information relevant to the system but not 


contained in the state of the agents; 


e The global state of a system is given by the sum of the agents’ local states together with the state 


of the environment; 
e A run is a function from time to global states; 
e A point is a pair (r,m) consisting of a run r and a time m; 


e A system is defined as a set of runs. A system can also be viewed as a Kripke structure supple- 


mented with a way to assign truth values. 


This model is illustrated on figure[T6.3Jand appears as a sufficient basis for defining the basic concepts of 
situation analysis. Indeed, the local state of an agent A; can also be called its Knowledge-Base (denoted 
by KB;) upon which an awareness function delimits these subsets, the latter being particular views of a 
given situation (see section [6.4.2] on contextualization). From an algebraic point of view, a same agent 


can generate different views of the same situation, either disjoint or overlapping or nested. 


16.3 Sources of uncertainty in Situation Analysis 


Situation analysis is experimental by nature. A major obstacle encountered in the process lies in the 


ubiquity of uncertainty. While in a previous paper [19], we highlighted four main facets of uncertainty: 
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Situation representations 











State of the environment 





Figure 16.3: The general model of a distributed system proposed by Halpern and Moses in [3] adapted 


for situation representation. 


(1) Meaning (mental sate or property of the information), (2) Interpretation (objective or subjective), 
(3) Types (fuzziness, non-specificity and discord) and (4) Mathematical representations (quantitative 
vs. qualitative approaches), in this section, we rather review the potential sources of uncertainty and 
obstacles arising in a situation analysis context. 

Uncertainty has two main meanings in most of the classical dictionaries [I9]: Uncertainty as a state 
of mind and uncertainty as a physical property of information. The first meaning refers to the state of 
mind of an agent, which does not possess the needed information or knowledge to make a decision; the 
agent is in a state of uncertainty: “I’m not sure that this object is a table’. The second meaning refers 
to a physical property, representing the limitation of perception systems: “The length of this table is 
uncertain” (given the measurement device used). 

Sociologists like Gérald Bronner [20] consider uncertainty as a state of mind, this state depending on 
our power on the uncertainty, and our capacity to avoid it. He distinguishes two types of uncertainty: 
uncertainty in finality (or material uncertainty) and uncertainty of sense. Uncertainty in finality is “the 
state of an individual that, wanting to fulfill a desire, is confronted with the open field of the possibles” 
( “Will my car start?”). Whereas uncertainty of sense is “the state of an individual when a part, or the 
whole of its systems of representation is deteriorated or can be”. Uncertainty in finality corresponds to 


the uncertainty in which lies our understanding of the world, while uncertainty of sense bears on the 
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representation of the world. Bronner identifies three types of uncertainty in finality, according to one’s 


power on uncertainty, and the capacity to avoid it: 
e Situation of type I: Uncertainty does not depend on the agent and can not be avoided; 
e Situation of type II: Uncertainty does not depend on the agent but can be avoided; 
e Situation of type III: Uncertainty is generated by the agent and can be avoided. 


In situation analysis, agents are confronted to uncertainty of sense (data driven) from the bottom-up 
perspective and to uncertainty in finality (goal driven) from the top-down perspective. It follows that 


there are two kinds of limits to state estimation and prediction in Situation Analysis: 
1. Ontological limits due to the nature of things and 
2. Epistemic limits due to cognitive limitations of the agents, human or artificial. 


Typical obstacles [21] are anarchy and instability when the situation is not governed by an identifiable 
law or in the absence of nomic stability. Chance and chaos, are serious obstacles to state evaluation and 
prediction as far as an exact estimation is sought for although regularities and determinism are observed. 
Another typical obstacle is the vagueness of concepts. Natural language concepts are inherently vague, 
meaning that their definition is approximate and borderline cases arise. This is true as well for properties 
but also for concepts. 

Indeterminacy is another unavoidable obstacle. It may arise from paradoxical conclusions to a given 
inference (i.e. Russell’s paradox, or sorites paradox), from impossible physical measurements (i.e. posi- 
tion and speed of an atomic particle) or for practical reasons (i.e. NP-complete problems). From a given 
theoretical stand point (classical vs. quantum mechanics), indeterminacy may nevertheless be proposed 
as a conclusion to specific unanswerable questions in order to nevertheless allow reasoning using the 
remaining information. 

Ignorance of the underlying laws governing the situation is a major cause of uncertainty. For example 
not knowing that a given tactical maneuver is possible precludes the possibility to predict its occurrence. 
Especially present in human affairs innovation can be a major obstacle in SA. New kinds of objects 
(weapons), processes (courses of action) or ideas (doctrines) arise and one has no choice but to deal with 
it and adapt. 

Myopia or data ignorance, is also a typical problem in SA. Data must be available on time in order 
to assess a situation, meaning that even if the information sources exist circumstances can prevent their 
delivery. Another case of myopia occurs when data is not available in sufficient detail, as in pattern 
recognition when classes are only coarsely defined or when sensors have limited spatial resolution. Data is 
thus accessible through estimations obtained by sampling as in surveys, by the computation of aggregates 


as in Data Fusion or by the modelization of rough estimates. As a consequence the available data is only 
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imprecise and incomplete and leads most of the time to conflicting choices of decision. A major task of 
SA is change detection, failure prediction. 

Any attempt in the conception of a system is be bounded by inferential incapacity of human or 
artificial agents. Limitations in agents can arise because of a lack of awareness. As far as knowledge is 
concerned, an agent cannot always give a value to a proposition, for example if it is not even aware of the 
existence of the concept denoted by the proposition at hand. Agents are resource bounded meaning that 
agents have only limited memorization capabilities, in some cases they have power supply limitations, etc. 
or have only limited cognitive and computational capabilities. Agents may also have limited visual or 
auditory acuity. Sometimes, these limitations come from the outside and are situation driven: electronic 
countermeasures, only a limited amount of time or money is available to do the job, etc. Furthermore 
agents cannot focus on all issues simultaneously. As Fagin and Halpern puts it in [22] “[...] Even if A 
does perfect reasoning with respect to the limited number of issues on which he is focusing in any given 
frame of mind, he may not put his conclusions together. Indeed, although in each frame of mind agent 
A may be consistent, the conclusions A draws in different frames of mind may be inconsistent.” Finally, 
agents must work with an inconsistent set of beliefs. For example, we know that lying is amoral, but in 


some case we admit it could be a good alternative to a crisis. 


16.4 Ontological principles in Situation Analysis 


Given the limitations and the sources of uncertainty involved in Situation Analysis (section [[6.3), we 
state in this section four main ontological principles that should guide SA systems design in practice: (1) 
allowing statements and reasoning about uncertainty to be made, (3) contextualization, (2) enrichment 


of the universe of discourse, and (4) allowing autoreference. 


16.4.1 Allowing statements and reasoning about uncertainty 


We begin with two observations that will guide the discussion of this section: 


1. Many concepts are linked to uncertainty: Vagueness, indeterminacy, truth, belief, indiscernibility, 
ambiguity, non-specificity, incompleteness, imprecision to name a few. Although these concepts are 
a priori distinct, it is common to confuse them and to be unable to talk about one without any 
reference to the other. The recent development of new theories of uncertainty aims at separating 
these aspects, and bring clarifications in this direction as it is the case for probability theory and 
fuzzy logic. Another contribution in this direction is the axiomatization proposed by Fagin and 
Halpern in [4] which provides a semantical structure to reasoning about both belief and probability, 


and thus distinguishing these two often confused concepts. 
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2. Although it is possible to deal with uncertainty in general using purely qualitative notions, the 


mixture of discrete and continuous objects composing the world has led to introduce degrees. 


In a very general sense as written in the previous section (section[[6.3), uncertainty is often seen as the 
result of indeterminacy. As far as formalization is concerned the classical means of reasoning soon exposed 
their limitations. Propositional Calculus (PC) relies on the principle of bivalence expressing the fact that 
a proposition is either TRUE or FALSE. Hence, only two truth values are allowed leaving no way to 
express indeterminacy. The most common way go beyond bivalence is to introduce supplementary truth 
values in the PC framework. The signification of the supplementary truth value differs from one author 
to another, from one logic to another. However, it is common to denote truth, falsity and indeterminacy 
by 1, 0 and 3 respectively. 

Here the problem of the meaning of the uncertainty arises. For a given type of uncertainty (contingent 
future events, indetermination, etc.) corresponds a particular interpretation of the set of connectives. If 
Lukasiewicz was primarily interested with the problem of contingent future event or possibility, Kleene in 
1938 [23] proposed three value logics used in recursion theory in order to design stopping criteria and allow 
for indeterminacy of some propositions. Bochvar (1938) [24] proposed a logic quantifying propositions as 
sensible and senseless. For him true and false propositions are meaningful, the third truth-value designates 
meaningless or paradoxical propositions. Bochvar's system of logic, was later rediscovered by HalldEn 
in 1949 [25] and used to process vague and nonsensical propositions. In fact, the different meanings of 
uncertainty are translated in the particular definitions given to logical connectors with respect to common 
intuition of the terms at hand. 

It is important to note that in general the truth values are not ordered and just like in PC the truth 
values are purely conventional. In this sense, the so-called values of the truth tables can be considered 
qualitative (see Fig. [[6.4}(a)). However, these three truth values can also be ordered, representing then 
a rough quantitative description of the world (see Fig. [6.4 (b)). But intuition also tells us that things 
are not always clear cut in the real world and rather appear in tones of gray. A three-valued logic can 
be generalized to a n-valued logic and by extension to fuzzy logic with an infinite number of truth- 
values ranging on the real set interval [0; 1]. Such an extension introduces thus an order between truth 
statements (see Fig. [6.4}(c)). Another consequence of this extension is that the notion of uncertainty 
is now expressed explicitly in terms of truth or falsity. While in a three-valued logic, indeterminacy, 
possibility or vagueness are expressed as neither TRUE nor FALSE, in Lukasiewicz’s or fuzzy logic, to take 
a more recent example, the uncertainty is expressed by an explicit reference to truth or falsity. 

The introduction of degrees imposes then an order between values. The truth becomes then a kind of 
false and vice-versa, and the qualitative aspect of the three initial truth values is lost, with their indepen- 
dence. Yet another extension which conciliates both qualitative and quantitative aspects of indeterminacy 


is to consider different independent aspects of uncertainty and represent them on independent axes. This 
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Figure 16.4: Supplementary truth values for representing indeterminacy. 


is the principle developed by Smarandache in the neutrosophic logic [10] [I], where the considered as- 
pects of uncertainty are truth, falsity and indeterminacy (see Fig. [[6.4}(d)). Hence, in neutrosophic logic 
both the qualitative aspect of non-ordered three-valued logics and the quantitative aspect of fuzzy logic 
are combined. One main benefit of neutrosophic logic is that indeterminacy can be addressed by two 
different manners: (1) Using the indeterminacy function ad of the truth and falsity functions 
or (2) using the three previous functions as it is commonly done in fuzzy logic. Moreover, because of the 
assumed independence of the three concepts of truth, falsity and indeterminacy, NL is able to represent 
paradoxes, for example something that is completely true, completely false and completely indeterminate. 


Neutrosophy and neutrosophic logics are introduced respectively in sections [6.51] and [165.2] 
1Note however that although truth, falsity and indeterminacy are considered independently in NL, the use of the 


hyperreals is a means to make them dependent. Indeed, an absolutely TRUE proposition (T(¢) = 1+) is also absolutely 


FALSE (F(¢) =~ 0). This condition is not required for relatively TRUE propositions (T ($) = 1) [10]. 
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Finally, we remind that although indeterminacy has been discussed from a logical point of view, in- 
determinacy is also represented in more quantitative approaches. Indeed, in probability theory, assigning 
a probability value in ]0; 1[ to an event translates the indeterminate state of this event. It has nothing to 
do with the truth of the event, but rather with its potential occurrence. By extension, Dempster-Shafer 
theory, possibility theory or Dezert-Smarandache theory are other numerical approaches to deal with 


indeterminacy. Some of these approaches are briefly discussed in section [6.5.3] 


16.4.2 Contextualization 


In SA, the operation of contextualization serves many purposes and is at the basis of the abstract notion 
of situation itself as it is understood by defence scientists, software engineers and commanding officers 
as well. According to Theodorakis [26], in the context of information modelling, “a context is viewed as 
a reference environment relatively to which descriptions of real world objects are given. The notion of 
context may be used to represent real world partitions, divisions, or in general, groups of information, 
such as situations, viewpoints, workspaces, or versions”. In this sense a context is a mental, thus partial, 
representation of a real situation. For Theodorakis “A situation records the state of the world as it 
is, independently of how it is represented in the mind of an agent. A situation is complete as it records 
all the state of the world. Whereas, contexts are partial as they represent situations and hence capture 
different perspectives or record different levels of detail of a particular situation” . 

For Brézillon [27] a context can be “a set of preferences and/or beliefs, a window on a screen, an infinite 
and only partially known collection of assumptions, a list of attributes, the product of an interpretation, a 
collection of context schemata, paths in information retrieval, slots in object-oriented languages, buttons 
which are functional, customizable and shareable, possible worlds, assumptions under which a statement 
is true or false, a special, buffer-like data structure, an interpreter which controls the system’s activity, 
the characteristics of the situation and the goals of the knowledge use, entities (things or events) related 
in a certain way, the possibility that permits to listen what is said and what is not said’. 

Contextualization is an operation largely applied in artificial intelligence, natural language processing, 
databases and ontologies, communication, electronic documentation and machine vision. The principal 
benefits from contextualization are the modularity of representation, context dependent semantics, and 
focused information access [27]. As far as SA is concerned, a context or if one prefers, a representation 
of a situation, is a means to encapsulate information while eliminating the unnecessary details, makes 
it possible to refer to a given representation of the world while allowing different interpretations on the 
meaning of this precise representation and finally gives a access to a mechanism to focus on details when 
required. 

Using the notation defined earlier (section [G.2.3), a context or a situation s is a view on the global 


state of an agent A built on a given database KB. This view can be shared by multiple agents through 


16.4. ONTOLOGICAL PRINCIPLES IN SITUATION ANALYSIS 


349 


communication links. As will be shown below, contexts are means to make reasoning local allowing for 


example an agent to hold incoherent beliefs or to deal with incomplete information and knowledge. 


Contextualizations are usually based on criteria such as 

e time: limits due to real time applications requirements or planning objectives, 
e space: limits due to range of sensors or territorial frontiers, 

e function: discrimination according to objects functions or agents social roles, 


e structure: distinction between cooperative or egoistic behavior. 


Agents performing situation analysis are embedded in complex and dynamically changing environ- 


ments. Many problems arise (1) from the unpredictability and instability of such environments, (2) from 


the particularities of the SA tasks to accomplish and finally (3) from the agents own limitations, both 


physical and mental. 


1. The unpredictability and instability of the environment will force the agent to concentrate on the 


most certain information available and leave unmeasured events that are not yet accessible. 


In this case, the result of contextualization is for example the constitution of the o-algebra used in 
probability theory (see section[T6.5.3). Similarly, the generic operation consisting in the specification 
of upper and lower bounds over sets of events is also a form of contextualization. This operation 
is present in different theories such as Demspter-Shafer theory (belief and plausibility measures or 


lower and upper probabilities) and rough set theory (lower and upper approximations). 


. Depending on the complexity of the environment, the different tasks involved in SA will not require 
the same level of attention, the same depth of reasoning and nor be subject to the same reaction 
delays. Consequently the agents will only consider limited time and space frames in order to 
efficiently answer operational requirements. These limits are imposed voluntarily by designers of 
SA systems, implemented by experienced game players and but also innate to many biological 


systems. 


Two models have been proposed for the partition of sets of possibles worlds (see section [[6.6-1), 
the Rantala and sieve models. Rantala models [28] are a modification of the standard Kripke 
model semantics that incorporate the notion of impossible worlds, allowing to distinguish them 
from possible worlds. In these impossible worlds anything can hold even contradictions. The 
notion captures the fact that a non-ideal agent may believe in things that are not consistent, false, 
etc. but are nonetheless considered as epistemic alternatives. Sieve models have been proposed by 
Fagin and Halpern in 1988 [22] in order to prevent the problem of omniscience by introducing a 


function that act as a sieve. Instead of introducing nonstandard world or situations, sieve models 
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introduce segregation between formulas that can be known or believed and other that cannot. The 
sieve function indicates in fact if the agent is aware of a given formula in a given situation. Being 


aware amounts at knowing or believing the formula in question. 


3. It is a common practice in SA to consider resource bounded agents, even implicitly. In economics the 
notion of unbounded rationality refers to the consideration of all possible alternatives and choosing 
the best one often using optimization techniques. The opposite view of rational choice theory, 
bounded rationality, rather considers that there are finite limits to information and calculations 
a human brain or a mechanical memory device can hold i.e. Bremermann’s computational limit. 
This view also holds that deliberation costs should be included in models, limiting furthermore 


rationality for the sake of economy. 


According to many authors PI [BOBI], in neutrosophy the attribution of truth values can be bound to 
specific circumstances making it thus a contextual theory of truth [32]. Unary neutrosophic connectives 
such as A’, Anti-A, Neut-A (see section [16.5.1), seem particularly interesting for the manipulation of 


contextual concepts. 


16.43 Enrichment of the universe of discourse 


The universe of discourse is the set of objects (concrete or abstract) considered in a given context. It 
could be a set of classes, a set of targets, a set of actions to take, etc, but also a set of possible worlds 
(i.e. of possible states of the world). Let S represent the universe of discourse, the set of all possible 
outcomes of an experiment: 


S = {81,59,...,5n} (16.1) 


The universe of discourse is in a sense, the result of a contextualization operation (section [[6.4.2) since 
all objects existing in the world are not present in this set; a choice has been made (voluntarily or not). 
It is then the support for problem-solving situation and represents the objects about which we are able 
to talk. 

However, it represents an ideal model assuming a perfect description. Unfortunately, real world is 
often different and more complex than expected. Indeed, on one hand the agents have a limited access 
to knowledge and on the other hand, objects in the real world itself are not clear cut and a perfect 
description is in general impossible. These features of reality cannot in general be taken into account in 
the modelization of the problem (i.e. in the definition of the universe of discourse). Hence, a solution to 
deal with the two different kinds of limitations we face to in SA, epistemic limitation (due to cognitive 
limitations of the agents, human or artificial) and ontological limitation (due to the nature of things), 


(section [[6.3), is to artificially enrich the universe of discourse. 
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1. The failure of the sources of knowledge of an agent leads mainly to indiscernibility (see section 
(16.3). Indeed, an epistemic limitation implies the necessity of considering other objects than those 
originally present in S. In particular, the incapacity of a agent to distinguish between two objects 
sı and sa at a given time, in a given context is represented by sı U s2 which is another object, built 
from S but not explicitly in S. sı U sa is then the best answer the agent can give at a given time, 


even if it knows that the answer is either sı or sə. 


In probability theory, because of the axiom of additivity, we cannot refer to sı U s2 independently of 
the rest of the universe. Indeed, (si U s2) = (si) + (sa) — u(s1ı N s2) if y is a probability measure 
over S. Hence, to account for this limitation of the access to knowledge (epistemic limitation), we 


can enrich the universe of discourse and consider the power set of S, i.e. the set of all subsets of S: 


25 = {A|A C S)=(0,81,82,...,Sn,(81,82),..., (Sn—1, Sn), off (16.2) 


where @ denotes the empty set. This enrichment of the universe of discourse allows ignorance and 
uncertainty to be best represented, as well as a supplementary types of conflict to be taken into 
account. If probability theory is based on the classical set notion, the notion of power set is the 
basis for Dempster-Shafer theory (see section [6.5.3] for a brief description), possibility theory and 
rough sets theory. In this context, we can assign measures to every subset of S, independently 
of the others. Note finally that Dempster-Shafer theory is based on the assumption of a universe 
of discourse composed by an exhaustive list of mutually exclusive elements [33], a very restrictive 


constraint in practice. 


2. Another limitation is due to the fact that the observable world is more complex than we can describe. 
This ontological limitation is linked to the properties of the objects and has nothing to do with our 
perception means. For example, sı N s2 represents another object composed by both sı and sa. It 
is neither sı nor s2 but something between them. Hence, yet another extension is the construction 
of the hyper-power set constituted of all the combinations of the union and intersection operators 


applied to the elements of S: 
DS = {@, s1,...,8n, (81 U 82),...,8, (81M 82),..., (81M 82) Usg,...} (16.3) 


If the elements of S are mutually exclusive (s; N s; = Ø, for alli 4 j), then DS = 2%. However, 
considering D* is a more general case allowing s; N sj # Ø, i.e. allowing objects of the universe 
of discourse to overlap. An example, is an universe constituted of vague concepts. Extending the 
definition of the probability measure the hyper-power set is the principle of Dezert-Smarandache 
theory [33]. In this framework, no initial assumption on the mutually exclusivity on S is imposed, 


“Here, (s1, 2) is used to denote (s1 U s2). 
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and the exhaustivity is somewhat delayed since new objects can be constructed on those of S. A 


brief description of DSmT is proposed in section [6.9.3] 


Therefore, we can say that while Dempster-Shafer theory of evidence is an epistemic theory since it 
only represents epistemic limitations, Dezert-Smarandache is basically an epistemic and ontological theory 


since this framework combines both epistemic and ontological view points. 


16.4.4 Autoreference 


By autoreference we mean the capacity of an agent for introspection or selfreference. For example, an 
agent should be granted the capacity of holding beliefs belief about its own declarations, and not only 


about the declarations of the other agents. 


1. A classical mean for modelling autoreference is by the way of hypersets. The notion of hyperset 
has been first introduced by Aczel [34] and Barwise and Etchemendy [35] to overcome Russell’s 
paradox} A recursive definition extends the notion of classical set, allowing hypersets to contain 
themselves, leading to infinitely deep sets (for example, x = 1+ 1/x). A well-founded set is a set 
without infinite descending membership sequence, whereas the others are called non-well-founded 


sets. 


2. In modal logics, Kripke structures are used as a semantics (see section[T6.6.1). In a Kripke structure, 
an accessibility relation is defined over a set of possible worlds which models either the structure of 
the world or the agent properties. The desired properties of an agent are then modeled by imposing 
some properties to the accessibility relation. In particular, if the relation is reflexive and transitive, 
then the agent possesses the capacity of positive introspection (the agent knows that it knows). Also 
if the relation is an equivalence relation, the agent is capable of formulating declarations about its 


ignorance (negative intropection). 


Although these two models, hypersets and Kripke models, are presented here as distinct ones, both are 
semantics of (multi-agent) modal logics. In [37] [38], it has been proven the equivalence of both semantics. 
Indeed, with the notion of hyperset comes the graph metaphor which replaces the “container” metaphor 
used in classical set theory (see figure [I6.5). By definition, a graph G is a pair (S,R), where S is a set of 
nodes and R is a relation over S. A labeled graph is a triple S = (S,R,7) = (G, 7) where G is a graph 
and 7 is a valuation function from P to 25, with P being a set of propositional variables, that assigns 
to each p of P a subset of S. However, a Kripke model can be viewed as a directed labeled graph, whose 


3“Russell’s paradox is the most famous of the logical or set-theoretical paradoxes. The paradox arises within naive set 
theory by considering the set of all sets that are not members of themselves. Such a set appears to be a member of itself if 


and only if it is not a member of itself, hence the paradox” [38]. 
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Figure 16.5: Representation of classical sets. Arrows in figure (b) mean that s; is a member of S. 


nodes are the possible worlds, the link between nodes representing the accessibility relation, labeled by 
truth assignment 

First introduced for modal logics and knowledge logics, the model proposed by Kripke appears as an 
elegant structure for reasoning about knowledge in a multi-agent context. Moreover, it is based on the 
notion of possible world, which is close to the intuitive notion of situation. Hence, we choose it as the 
basic structure for situation analysis. In section [16.6] we develop our argumentation to connect Kripke 
structures with neutrosophic frameworks. After a more formal description of Kripke structures (section 
(16.6.1), we first extend this structure to neutrosophic logic (section [[6.6.2). Then, considering mainly 
the notion of possible worlds, we extend probability structures to DSm structures (section [6.6.3). And 


finally, we make the connection between DSmT and neutrosophic logic through Kripke structures (section 


(16.6.4). 


16.5 Neutrosophic frameworks for Situation Analysis 


16.5.1 Neutrosophy 


Neutrosophy is presented by F. Smarandache as “a new branch of philosophy, which studies the origin, 
nature, and scope of neutralities, as well as their interactions with different ideational spectra” [O]. It is 


formalized as follows: 


Let A be an idea, a proposition, a theory, an event, a concept, an entity. Then, using different 
unary operators, we define 
e A’, a version of A; 


e Anti-A, the opposite of A; 


4Although the demonstration proposed in [57] [88] is more complex (!) it lies on the previous remark. 
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e Non-A, what is not A; 


e Neut-A, what is neither A nor Anti-A. 


Neut-A represents a neutrality in between the two extremes, A and Anti-A. Hence, between A and 
Anti-A there is a continuum-power spectrum of neutralities Neut-A, A — Neut-A — Anti-A. Note that 
Non-A is different from Anti-A (Non-A Æ Anti-A), but also that Anti-A C Non-A, Neut-A C Non-A, 
An Anti-A = Ø, AN Non-A = Y. 


We give below an example for multi-agent situation analysis: 


Let's assume a system composed of n agents A1, ..., An. Let call KB; the Knowledge-Base 


of agent i, i =1,...,n. Then, 


e KB, is all the information agent A, has access to; 


e KB} is another version of KB: for example, an update of KB,, or KB, issued from a 


partition of the sources of information of A;, hence another view of KB; 


e Anti-KB, is all the information agent A; has not access to (or the information it did not 


use for a given representation of the situation); 


e Non-KB, is all the information agents 42, ..., An have access to, but not shared with 


A, plus the information nobody has access to; 


e Neut-KB, is all the information agents A2, ..., An have access to, but not shared with 


Ar. 


The only formal approaches derived from neutrosophy that will be studied in this chapter are: The 
neutrosophic logic introduced by Smarandache [10] [I] and the Dezert-Smarandache theory proposed by 
Dezert and Smarandache [33] [39]. In sections[16.5.2]and[16.5.3]Jwe review the basics of these approaches. 


16.5.2 Neutrosophic logic 


Neutrosophic logic (NL) is a method for neutrosophic reasoning. This non-classical logic is a multiple- 
valued logic which generalizes, among others, the fuzzy logic. It is the “(first) attempt to unify many 
logics in a single field” [TO]. 

While in classical logic, a concept (proposition) A is either TRUE or FALSE, while in fuzzy logic 
A is allowed to be more or less TRUE (and consequently more or less FALSE) using truth degrees, in 
neutrosophic logic, a concept A is T% TRUE, 1% INDETERMINATE and F% FALSE, where (T,1,F) € 


\|-0,1*||8. The interval ||70,1*|| is an hyperreal eres | the heigh part of this notation refering to a 


5Hyperreals - Non-standard reals (hyperreals) have been introduced in 1960. Let [0,1] be the real standard interval i.e. 
, the set of real numbers between 0 and 1. An extension of this interval is to replace the lower and lower bounds by the 
non-standard counterparts ~0 and 1+, being respectively 0 — e and 1 + e, where e > 0 is an infinitesimal number (i.e. such 


that for all integer n > 0, € < 2). 
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three-dimensional space. As a general framework, neutrosophic logic corresponds to an extension in three 


distinct directions: 
1. With A, are considered Non-A, Anti-A, Neut-A, and A’; 


2. The semantics is based on three independent assignments, not a single one as it is commonly 


used in the other logics; 


3. These three assignments take their values as subsets of the hyperreal interval ||~0,17||, instead 


in [0, 1]. 
A is thus characterized by a triplet of truth-values, called the neutrosophical value: 
NL(A) = (T(A), I(A), F(A)) (16.4) 


where (T(A),I(A), F(A)) c 1170, 1+[]?. 


16.5.3 Dezert-Smarandache theory (DSmT) 


Because the theory proposed by Dezert and Smarandache is presented as a generalization of Dempster- 
Shafer theory, the latter being itself interpreted as a generalization of probability theory, we briefly review 
the basics of these two theories before introducing DSmT. 


A probability space is a 3-tuple P = (S, x, 4) where: 


e S = {s1, 52,...,5n} is the sample space, the set of the elementary events, the set of all outcomes 


for a given experiment; 
e x is a o-algebra of S; 
e ¡is a probability assignment from x to [0, 1]. 


To each element of x is assigned a non-negative real number (A), a probability measure of A (or simply 
probability of A) that must satisfy the following axioms: (1) (4) > 0; (2) a(S) = 1; (3) MUZE, Ai) = 
Ni uld) if Ai N Aj = Ø for A; 4 Aj. 

Axiom 3 is also known as the condition of o-additivity, or simply axiom of additivity and plays a 
crucial role in the theory of probability. Indeed, it imposes a restriction on the measurable sets (i.e. the 
set to which we are able to assign probability measures), since one direct consequence is y(A) = 1— (A), 
where A = S\A. In other words, (4) does not depend on any p(B) such that B C A. 

The theory of evidence has been originally developed by Dempster in 1967 in his work on upper and 
lower probabilities [7], and later on by Shafer in its famous book A Mathematical Theory of Evidence [3], 
published in 1976. Often interpreted as an extension of the Bayesian theory of probabilities, the theory of 


evidence offers the main advantage of better representing uncertainty because the measures are defined 


356 CHAPTER 16. NEUTROSOPHIC FRAMEWORKS FOR SITUATION ANALYSIS 


on the power set of the universe of discourse, instead of the universe itself as the probability theory 
does. This particularity leads to the relaxation of the additivity axiom of the probability theory by a less 
restrictive one, a super-additivity axiom. 

A belief function is defined from 2° to [0,1], satisfying the following axioms: (1) Bel(@) = 0; (2) 
Bel(S) = 1; (3) For every positive integer n, and for every collection Ai,..., An of subsets of S, Bel(41 U 
...UAn) > 7; Bel(A;) — 0 ,<; Bel(Ain Aj) +... +(-1)"**Bel(AiN...9An). Contrary to the probability 
measure, the belief measure is non-additive and the axiom of additivity for probability theory is replaced 
by an axiom of superadditivity. The main consequence of this axiom is that every element of the power 
set of S is measurable. Hence, we can have Bel(A) > Bel(B) if B C A. 

A belief function is often defined using a basic probability assignment (or basic belief assignment) m 
from 2° to [0,1] that must satisfy the following conditions: (1) m(@) = 0 and (2) Y ycys M(A) = 1. 
Then we have Bel(A) = > pca pegs M(B). 

Dezert-Samrandache theory (DSmT) is another extension in this direction since all the ele- 
ments of the hyper-power set are measurable. Then a general basic belief mass is defined from D* to 
(0, 1], satisfying the following conditions: 

m(@)=Oand Y m(A)=1 (16.5) 
AEDS 
Hence, for example elements of the type of s; N sj, i # j are allowed to be measured. The general belief 
function is then defined by: 


Bel'(4)= X` m(B) (16.6) 
BCA,BEDS 


We note Bel’ to distinguish between the belief function in the Shafer sense, Bel. 
DSmT is thus a more general framework that deals with both ontological and epistemic uncertainty. 
However, as most of quantitative approaches it lacks a formal structure for reasoning. In the following 


section, we propose a way to add such semantics to DSmT. 


16.6 Possible worlds semantics for neutrosophic frameworks 


The possible world semantics provides an intuitive means for reasoning about situations. It delivers a 
general approach to providing semantics to logical approaches with applicability to neutrosophic logic 
(section[T6.6.2). Moreover, possible worlds semantics is often borrowed from logical approaches to fill the 


lack of semantics of numerical approaches, as it will be detailed below (section [16.6.3). 
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16.6.1 Kripke model 
A Kripke model is a mathematical structure that can be viewed as a directed labeled graph. The 


graph’s nodes are the possible worlds s belonging to a set S of possible worlds, labeled by truth assign- 


ments 7. More formally, 
A Kripke model is a triple structure Sx of the form (S,R,7) where 


e S isa non-empty set (the set of possible worlds); 
e RC Sx S is the accessibility relation; 


e 7: (S — P) — {0;1} is a truth assignment to the propositions per possible world. 
where P = {pj,...,Pn}isaset of propositional variables, and {0; 1} stands for (TRUE; FALSE}. 


A world s is considered possible with respect to another world s’ whenever there is an edge linking s and 
s'. This link is defined by an arbitrary binary relation, technically called the accessibility relation Figure 


Fig. [GG illustrates the following example: 


An agent is wondering if “it is raining in New York” (¢) and if “it is raining in Los Angeles” 
(y). Since this agent has no information at all about the situation, it will consider possible 


situations (worlds)S = {81, 82, 53, 54}: 


e A situation sı in which it is both raining in New York and in Los Angeles, i.e. 7(s1)(¢) = 


TRUE and 7(s1)(wW) = TRUE. 


e A situation s2 in which it is raining in New York but not in Los Angeles, i.e. 7(s2)(¢) = 


TRUE and 7(s2)(w) = FALSE. 

e A situation s3 in which it is not raining in New York and raining in Los Angeles, i.e. 
m(53)(@) = FALSE and 7(s3)(w) = TRUE. 

e A situation s4 in which it is neither raining in New York nor in Los Angeles, i.e. 


m(84)(@) = FALSE and 7(s4)(w) = FALSE. 


16.6.1.1 modelling the structure of the world 


A very interesting feature of Kripke model semantics, is that it is possible to generate axioms for the 
different systems of modal logic by expressing conditions on the accessibility function defined on Sx. 
These conditions can be used to express properties or limitations of agents (according to a given model 
of the world). For example, any epistemic system built upon a Kripke model satisfying a reflexive 
accessibility relation satisfies also the true knowledge axiom (T). If the model satisfies a reflexive and 


transitive accessibility relation, it satisfies also the axiom of positive introspection (4). Satisfaction of the 
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Figure 16.6: Example of a set of possible worlds and their accessibility relations. 


axiom of negative introspection (5) is given by an equivalence relation (see table 6.1). System K45 is 
obtained by making transitive and Euclidian the accessibility function, whereas KD45 which is sometimes 
used to model evidential reasoning on Dempster-Shafer structures (see section [[6.6.3.2) is obtained by 


making R transitive, Euclidian and serial. This is summarized in , and explained below. 


Table 16.1: Axioms, epistemic logic systems and accessibility relations between possible worlds. 
Reflexive (T) Kọ — ọ (True knowledge) 


Reflexive + Transitive K¢— KKọ (Positive introspection) 





) 
) 


Equivalence Ko — K>K0 (Negative intropection) 


16.6.1.2 Truth assignment 


As previously said, to each world s € S, there is an associated truth assignment 7(s) defined from P to 
{0; 1} such that: 
lifsFp 
T(s)(p) = (16.7) 
OifsFp 








where p € P. s F p means that the world s entails the proposition p, or in other words, that p is TRUE 
in s. 

The assignments 7(s) are expected to obey to the classical definitions of the connectives so that for 
example m(s)(p) = S\n(s)(p), 7(s)(p A q) = 2(s)(p) N r(s)(a), ete. 

A formula is any composition of some elements of P with the basic connectives 2 and A. Let call 
® the set of formulae and ¢ an element of $. For example, 61 = pı A 7p2, d2 = p1, 03 = pi A... Pn, 
i E ®,i=1,...,n. Hence, the truth assignments m(s) are also defined for any formula of ®, 7(s)(¢) 


being equal to 1 if o is TRUE in s. 
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To each p of P, there is an associated truth set Ap of all the elements of S for which 7(s)(p) is TRUE: 


A, = {s € S|r(s)(p) = 1} (16.8) 





Ap is then the set of possible worlds in which p is TRUE, and can also be noted A, = {s € S|s E pj. By 
extension, to each formula ¢ is associated a truth set, Ag. 

Note that the elements of P are not necessarily mutually exclusive. A way to obtain mutually 
exclusive elements is to build the set A+, the set of basic elements, where a basic element] is a formula 
of the (conjunctive) form ô =p A... A ph with p; being either p; or =p;, pi € P. Any formula ¢ € ® can 
then be written in a disjunctive form as $ = 6, V... V dz, with 6; € Az. 

To each world s, there is an associated basic element 6 of A; describing thus the truth values of 
the propositions of P in S. Whereas many worlds can be associated to the same basic element, a basic 
element can be associated with any world (see example of section [[6.6.3.4). The basic elements are just 


an alternate way to specify the truth assignment 7. 


16.6.1.3 Multi-agent context 


The definition of Sx can easily be extended to the multi-agent case. Indeed, if we consider a set of agents 
A1,.--,An, then on the same set of possible worlds S, and with the same truth assignment 7, we can 
define n accessibility relations R;, i = 1,...,n, one per agent. 

The different conditions on the R¿s will characterize then the different properties of the Ais, facing to 


the same situation. 


16.6.2 Kripke structure for neutrosophic propositions 


We introduced in section [6.5.2] the basics of neutrosophic logic. 
While is classical logic, a formula ¢ is simply characterized by its truth value 7(¢) being either 0 
or 1 (TRUE or FALSE), in neutrosophic logic @ is allowed to be T% TRUE and F% FALSE, and 1% 


INDETERMINATE. ¢ is thus characterized by a triplet of truth-values, called the neutrosophical value: 


NL(¢) = (T(0), 1(@), F(@)) (16.9) 


where (T'(¢), [(¢), F(¢)) c ||~0,17||3, || 70, 1*]| being an interval of hyperreals. 
In an equivalent manner as it is done in quantum logic, where Kripke structures are extended to deal 
with fuzzy propositions [41], we propose here to extend the Kripke structure to deal with neutrosophic 


assignments. Hence, we have, 


A Kripke model for neutrosophic propositions is a triple structure SX” of the form (9,R, 7) 


where 


6A basic element is sometimes called an atom. 
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e S isa non-empty set (the set of possible worlds); 

e RC Sx S is the accessibility relation; 

e 7 = (TT, T1, Tp) is a neutrosophic assignment to the propositions per possible world, i.e. 
T : (S — P) —>||~0,1*|| with m being either my or 7, or Tp. 


where P = {p1,..., Pn} is a set of propositional variables. 


The “truth” assignment m of a classical Kripke model becomes then 7 = (rp,Tp,T1), a three- 
dimensional assignment, where mr is the truth assignment, mp is the falsity assignment and zy, is the 
indeterminacy assignment. Hence, in each possible world s of S, a proposition ¢ can be evaluated as 
tr(s)(¢) TRUE, mr(s)(¢) FALSE and a7(s)(¢) INDETERMINATE. It follows that to @ is associated a 


truth-set AZ, a falsity-set At and an indeterminacy-set Ai: 


As = {s € Sirr(s)(6) # 0) 
Ag = {s € Slrr(s)($) 4 0) 
Ag = {8 € Sirr(s)(6) 4 0) 


Note that AŞ, As and Az are (1) no longer related, (2) fuzzy sets and may overlap. 


16.6.2.1 Knowledge and belief 

Halpern in [42] gives the following definitions for knowledge and belief in PWS: 
e ġ is known if it is TRUE in all the possible worlds s of S 
e ¢ is believed if it is TRUE in at least one possible world s of S 


On the other hand, Smarandache [TO] uses the notion of world and states that T(¢) = 1* if ¢ is TRUE 
in all the possible worlds s of S (absolute truth) and T(¢) = 1 if ¢ is TRUE in at least one possible 
world s of S (relative truth) (see Tab. [[6-2). Hence, in the neutrosophical framework, we can state the 
following definitions for knowledge and belief: ¢ is known if T(¢) = 1+ = F(¢) =~ 0 and ¢ is believed 
if T(¢) = 1 = F(¢) =0. Table[T16.2]shows several special cases. 

Furthermore, one can consider the unary operators of neutrosophic logic (Non-¢, Anti-¢, Neut-¢, ¢’) 
to model new epistemic concepts but also as a means to represent situational objects, such as neutral 


situation, environment (to be detailed in the final version). 


16.6.3 Probability assignments and structures 


Let S be the frame of discernment, s a singleton of S and A any subset of S. In probability theory, 
measurable objects are singletons s of S. The measures assigned to any subsets A of S are guided by the 


additivity axiom. Hence, measurable elements belong to a -algebra x of 2%. In Dempster-Shafer theory, 
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Table 16.2: Neutrosophical values for special cases (adapted from [T0]). 


gis... in ... poss. world(s) | Neutrosophical value 


true T(¢) =1* = F(¢)= 0 
false F(¢) =1* =T(¢) = 0 
indet. I(¢) =1* 

true T(¢) =1= F(¢) =0 
false at least one F(¢) =1=T(¢)=1 
indet. Há) =1 

indet. 


any element of the power set of S, 2% is measurable. Finally, Dezert-Smarandache theory allows any 








element of the hyper-power set of S, D, to be measured. Apart these extensions to probability theory 
that rely on the definition set of the probability measure, there exists a clear interest for giving a better 
semantics to these numerical approaches. For its probabilistic logic, Nilsson uses the possible worlds 
semantics to build a “semantical generalization of logic”, combining logic with probability theory 
(see section [16.6.3.1). Later on, Fagin and Halpern [4] and also Bundy [43] extend Nilsson’s structure 
for probabilities allowing all elements of the power set to be measurable, leading to a general structure 
just as Dempster-Shafer theory generalizes probability theory, the Dempster-Shafer structure (see section 
[16.5.3.2). 

In the following, after a brief review of Nilsson and Demspter-Shafer structures, we extend the latter 
and propose a Dezert-Smarandache structure (section[[6.6.3.3), combining the DSmT framework and the 
possible worlds semantics. To end this part, we propose in section[16.6.3.4Jan example of the potential 


interest of such a structure. 


16.6.3.1 Nilsson structure 


A Nilsson structure is a tuple Sy = (S, x, 4,7) where 


S = ([s1,82,83,...), the set of all possible worlds; 


x, a o-algebra of subsets of S; 


Lt, a probability measure defined on x; 


e z:(S —> P) — {0;1}, is a truth assignment to the propositions per possible world. 
with P being a set of propositional variables. 


Note that (S, x, u) is a probability space, and Nilsson structure is also called a probabilistic structure. 


In this kind of structure, the only measurable elements are those of x. However, if we are interested in 
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any other formula of O, the best thing we can do is to compute the inner and outer measures [4] defined 


respectively by 
H(A) = sup{u(B)|B C A, B € x} and p*(A) = inf{yu(B)|B 2 A,B € x} 
The unknown value p(Ag) is replaced by the interval: 


pie(Ag) < u(y) < u” (4) (16.10) 


Hence, instead of a single probability measure u from x to [0,1], we can compute a pair of probability 
measures u, and u*. 

Because in a Nilsson structure, js is defined on x (the set of measurable subsets) means that xr (the 
image of x by 7) is a sub-algebra of x to ensure that u(¢d) = u(Aọ), for all $ € ®. Dropping this 
condition is a means to extend yu to 2° (hence Nilsson structure) and leads to Dempster-Shafer structure 


as formalized in and detailed below. The probability measure y is then replaced by its inner measure 


Hx- 


16.6.3.2 Dempster-Shafer structure 


Nilsson structure can be extended using the inner measure, i.e. allowing all the elements of 25 to be 
measurable. Because the inner measure turns to be the belief measure introduced by Shafer in its theory 
of evidence [8], the resulting structure is called Dempster-Shafer structure. Note that x and 7 are no 


longer required to be related in any sense. 
A Dempster-Shafer structure [A] is a tuple Sps = (S,2°, Bel, 7) in which 


e S = {51, S2, 83,...}, the set of all possible worlds; 
e 2%, the powerset of S; 
e Bel, a belief measure on 2°; 


e 7: (S — P) — {0;1}, is a truth assignment to the propositions per possible world. 
with P being a set of propositional variables. 


Note that we can simply write Sps = (S, Bel, m), where Bel is a belief function Bel : 25 — [0, 1], in the 
Shafer sense (see section [[6.5.3). 


A Nilsson structure is then a special case of Dempster-Shafer structures, in which 


Hx(Ag) = uw" (Ag) = w(Ag) (16.11) 


for any $ € È. 


T Another way is to consider a partial mapping 7, leading to Bundy’s structure of incidence calculus [23]. 
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16.6.3.3 Dezert-Smarandache structure 


In [83], the authors propose a generalization of Dempster-Shafer theory defining a belief function on the 
hyper-power set instead of the power set as Shafer. This theory is called Dezert-Smarandache theory 
or simply DSmT. In an equivalent manner to the extension of Nilsson’s structure to DS structure, the 
definition of u can be extended to D*, allowing all elements of the hyper-power set to be measurable. 
We obtain then what we can call a Dezert-Smarandache structure (DSm structure), an extension of the 


DS structure in an equivalent way as DSmT is an extension of Dempster-Shafer theory. 
A Dezert-Smarandache structure is a tuple Spsm = (S, D*, Bel, 7) where 


e S = {51, 82, 83,...}, the set of all possible worlds; 
e DS, the hyper-power set of S; 
e Bel’, a general belief measure on D; 


e 7: (S — P) — {0;1}, is a truth assignment to the propositions per possible world. 
with P being a set of propositional variables. 


Note that we can simply write Spsm = (S, Bel”, 7) where Bel’ is the generalized belief function defined 


on D5, as defined by Dezert and Smarandache (see section[16.5.3). 


16.6.3.4 Example: Ron suits 
This example is proposed in [4] as Example 2.4: 


“Ron has two blue suits and two gray suits. He has a very simple method for deciding what 
color suit to wear on any particular day: he simply tosses a (fair) coin. If it lands heads, 
he wears a blue suit and if it lands tails, he wears a gray suit. Once he’s decided what color 
suit to wear, he just chooses the rightmost suit of that color on the rack. Both of Ron’s blue 
suits are single-breasted, while one of Ron’s gray sutt is single-breasted and the other is double- 
breasted. Ron’s wife, Susan, is (fortunately for Ron) a little more fashion-conscious than he is. 
She also knows how Ron makes his sartorial choices. So, from time to time, she makes sure 
that the gray suit she considers preferable is to the right (which depends on current fashions 
and perhaps on other whims of Susan). Suppose we don’t know about the current fashion 
(or about Susan’s current whims). What can we say about the probability of Ron’s wearing a 


single-breasted suit on Monday? M|” 


Let P be a set of primitive propositions, P = {pi,p2}. Let p¡=“The suit is gray” and let po=“The 


suit is double-breasted”. Then A+, the corresponding set of basic elements is: 


At = {p1 A p2, pi A pa, 7p1 A pa, 2p1 A pa) 
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A; is thus a set of mutually exclusive hypotheses: “Ron chooses a gray double-breasted suit”, ..., “Ron 
chooses a blue single-breasted suit”. 

S is the set of possible states of the world, i.e. the set of possible worlds, where a state corresponds 
in this example to a selection of a particular suit by Ron. To fix the ideas, let number the suits from 1 
to 4. Hence, S = {81, 82, 83, 84}, si being the world in which Ron chooses the suit i. Table[T6.3] lists the 
possible worlds and their associated meaning and atoni Table [16.4] give some sets of worlds of interest 


and their associated formula. An alternative to describe the state of a world (i.e. the truth values of each 


Table 16.3: The 4 states of the worlds and their associated basic element. 


World Meaning Basic element 
Sı Blue single-breasted suit nb 1 =p1 A —p2 

S2 Blue single-breasted suit nb 2 ~pı A pa 

83 Gray single-breasted suit pi A ape 

S4 Gray double-breasted suit pı A p2 


propositions in P) is by using 7 is a truth assignment defined from P to 2°. For each s in S, we have a 
truth assignment 7(s) defined from P to {0;1}, such that r(s)(p) = 0 if p is false in s, and m(s)(p) = 1 


if p is true in s. 


Table 16.4: Some subsets of possible worlds of interest and their associated formula. 


World(s) Meaning Formula 
(s1, s2) A blue suit api 
(s3, 84) A gray suit pi 


(s1, 2,83) A single-breasted suit  —p2 


Here, we have only 4 measurable events: ju(s1,82) = p(s3,s4) = 3, p(Ø) = 0 and a(S) = 1. The 
question of interest here (What is the probability of Ron’s wearing a single-breasted suit?) concerns 
another non-measurable event, i.e. (51, 82,83). In M, the authors gave this example to illustrate the 
utility of attributing values to non-measurable events, and then introduce Demspter-Shafer structures. 
Their conclusion for this example is then that the best we can say is that 3 < p(s1, 52,53) < 1, based on 
the inner and outer measures. 

modelling the problem with 4 states means that given our prior knowledge, these states correspond 
to the only possible situations after Ron's selection: He will select one and only one suit among the 
4 available. However, suppose that the two parts of the suits may have been mixed so we have two 
pieces (trousers and jacket) on the same coat-hanger. The 4 possible worlds correspond then to the 4 


coat-hangers, and no longer to the 4 distinct suits. Imagining that the trousers is inside the jacket, Ron 
8Note that the basic element =p1 A pa is associated with any state, while =p1 A po is associated with two states, sı and 


S2. 
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will select his suit only on the basis of the color of the jacket. Suppose for example, that the coat- 
hanger he selects supports a blue jacket and gray trousers. Then, waht is the corresponding state of the 
world? Clearly, this situation has not been considered in the modelisation of the problem, based on a 
DS structure. However, using a DSm structure allow the elements of the hyper-power set of S to be 
measurable. Hence, the state resulting of a selection of a mixed suit corresponds to s; N sj, with i Æ j. 
This means that we are in both worlds s; and sj, and that with a single selection, Ron selected in fact 
two suits. So, we allow other events than those forecast to overcome. 

One benefit of the resulting structure for situation analysis, is that it provides an interesting framework 
for dealing with both vagueness and conflict, combining the logical, semantical and reasoning aspect 


through the possible worlds semantics, and the measuring, combination aspect through the DSmT. 


16.6.4 Connection between DSmT and neutrosophic logic in Kripke struc- 


tures 


Here we describe informally a possible connection between Dezert-Smarandache theory and the neutro- 
sophic logic. 

Let Spsm = (S, Bel’, r) be a DSm structure, and let SX” = (9,R,7) be the corresponding Kripke 
structure for neutrosophic propositions. Hence, we define a general neutrosophic structure to be Sy = 


(S, Bel’, R, 7), where: 


e S = {51, 82, 83,...}, the set of all possible worlds; 
e Bel’, a general belief measure on D*, the hyper-power set of S; 
e RC Sx S is the accessibility relation 


e 7 = (TT, T1, TF) is a neutrosophic assignment to the propositions per possible world, i.e. 


mw: (S — P) —>||-0,1*]] with m being either nr or mr or Tp. 


where P = {p1,..., Pn} is a set of propositional variables. 


In order to reason on this structure, we need a set of axioms (as it is for example done in [4] for belief 
and probability) characterizing valid formulae. This can be achieved by imposing conditions on the 
accessibility relation R, conditions yielding hopefully to neutrosophic behaving agents. 

Hence, the aim of this general structure is to conciliate (1) DSmT as a tool for modelling both epistemic 
and ontological uncertainty, (2) possible worlds for the representation of situations, (3) neutrosophic logic 
as a general logical approach to deal independently with truth, falsity and indeterminacy, and (4) Kripke 
structures as a support for reasoning and modelling the properties of a collection of interacting agents. 

We finally note, that although a connection can be found or stated, there is a priori no trivial link 
between the neutrosophic assignments (mr(s)(¢), mr(s)(¢), mr(s)(¢)) that quantify truth, falsity and 


9 We consider here the monoagent case, although the extension to the multiagent case is trivial. 
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indeterminacy of formulae, and the belief granted to the corresponding sets of possible worlds through 


the general belief function proposed in DSmT, Bel’. 


16.7 Conclusion 


In this chapter, we proposed a discussion on neutrosophy and its capacity to tackle the situation analysis 
challenges. In particular, we underlined and connected to neutrosophy four basic ontological principles 
guiding the modelization in Situation Analysis: (1) allowing statements about uncertainty to be made, (2) 
contextualization, (3) enrichment of the universe of discourse, (4) allowing autoreference. The advantages 
of DSmT and neutrosophic logic were studied with these principles in mind. In particular, we highlighted 
the capacity of neutrosophic logic to conciliate both qualitative and quantitative aspects of uncertainty. 
Distinguishing ontological from epistemic obstacles in SA we further showed that being based on the 
power set, Dempster-Shafer theory appears in fact as an epistemic theory whereas Dezert-Smarandache 
theory, based on the richer hyper-power set, appears capable to deal with both epistemic and ontological 
aspects of SA. Putting forward the connection between hypersets and Kripke structures as means to 
model autoreference, we then focused on Kripke structures as an appropriate device for reasoning in SA. 
In particular, we showed that it is feasible to build a DSm structure upon the possible worlds semantics, 
an extension of the classical probabilistic and Dempster-Shafer structures. Considering neutrosophic 
logic, we showed that is could be possible to extend Kripke structures in order to take into account 
neutrosophic propositions, i.e. triplets of assignments on intervals of hyperreal numbers. We also showed 
how to represent the concepts of belief and knowledge with hyperreal truth (resp. falsity, indeterminacy) 
assignments on possible worlds. This allows one to introduce a clear qualitative distinction between certain 
belief and knowledge, a distinction that is not clear in traditional epistemic logic frameworks. Finally, we 


proposed a connection between neutrosophic logic and DSmT in the Kripke semantics setting. 
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Abstract: This chapter presents an environmental application of DSmT for the 
land cover prediction. The spatial prediction of land cover at the field scale in winter 
is useful to reduce the bare soils in agricultural intensive regions. Fusion process with 
the Dempster-Shafer theory (DST) proved to have limitations with the increase of 
conflict between the sources of evidence that support land cover hypotheses. Several 
modifications may be used such as source weighting or the hedging methods, but with 
no benefit in the considered case studied since the conflict may not explain by itself 
all the bad decisions. Actually, sources of evidence may induce all together a wrong 
decision. Then, it is necessary to introduce paradoxical information. Nevertheless, 
sources of evidence that are in use, are defined according to hypothesis “covered soil” 
or “bare soil” in the frame of DST. We investigate several points of view to define 
the belief assignments of the hyper-power set of the DSmT from the initial power set 


of DST. So, smart belief assignments induce a better prediction of bare soils. 


Samuel Corgne is also affiliated with TAMCIC, CNRS FRE 2658, team TIME, GET/ENST Bretagne, France. 
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17.1 Introduction 


n intensive agricultural areas, water quality may be improved by reducing bare soil surfaces during 
I. winter months. In this context, the knowledge of the spatio-temporal variations of the land use 
and cover as well as the spatial prediction of the land cover at the field scale appear essential for the issue 
of bare soils reduction. Land-cover prediction, that is useful for stakeholders that manage water-quality 
programs in focusing on the areas where the probability to find a bare soil is high, requires the identifica- 
tion and characterization of the driving factors of observed land-cover changes. The high variability of the 
driving factors that motivate land-cover changes between two successive winters induces the integration 


of uncertainty in the modelling of the prediction process. 


Several short-term predictions have been simulated with the Dempster-Shafer (DS) theory in pre- 
vious studies to assess land-cover distribution in winter on a relatively intensive farming watershed of 
61.5km? [I]. This study area, located in western France, produces significant amounts of nitrogen be- 
fore winter infiltration of water. Fusion process with the DS theory proved to have limitations with the 
increase of conflict between the sources of evidence that support land cover hypotheses. Several mod- 
ifications may be used (such as source weighting or the Hedging methods) but with no benefit in our 
application. It appears that conflict may not explain by itself all the bad decisions. Actually, each sources 
of evidence may induce all together a wrong decision. Then, paradoxical information was introduced to 


improve the prediction accuracy. 


A first application of the Dezert-Smarandache theory on the study area has pointed some results a 
little bit better than the DS, but the rate for the hypothesis “bare soil” was still inferior to 40% of good 
prediction. An improvement of the fusion process must be performed specially for this hypothesis. In 
this application, sources of evidence that are in use, are still defined according to hypothesis “Covered 
soil” or “Bare soil” in the frame of the Dempster-Shafer theory. Mass functions assignment determined 
from statistical analysis and expert knowledge are defined to support the hypotheses but the high level of 
conflict between sources requires a finest mass attribution and a “contextual” fusion process to manage 


the uncertainty and the paradoxical. 


This chapter focuses on the application of the Dezert-Smarandache theory for the land-cover prediction 
in winter, and more precisely on the transfer from evidence to plausible and paradoxical reasoning. Our 
objective is to improve the land-cover prediction scores in investigating several points of view to define 
the belief assignments of the hyper-powerset of the Dezert-Smarandache theory from the initial powerset 
of the Dempster-Shafer theory. A first part concerns the identification and hierarchization of the driving 


factors that drive the land cover changes on the studied watershed for their transformation in pieces 
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of evidences for the selected working hypothesis. The other one presents the process of the land cover 
modelling with the Dezert-Smarandache theory comparatively to the Dempster-Shafer theory and its 


adaptation for this specific environmental study. 


17.2 Determination of information sources 


The land cover in winter has been classified from remote sensing images in two land cover categories, 
“Bare soil” and “Covered soil” that correspond to the two hypotheses of work. The determination of the 
information sources for each hypothesis for the fusion process consists in identifying and hierarchizing 


the factors that motivate the land cover changes between winters for the studied period (1996-2003). 


17.2.1 Identification of the driving factors of land cover change 


The land-cover changes between winters in intensive agricultural regions are characterized by an high 
spatio-temporal variability depending on factors of several origin (economical, social, political, physics 
constraints) that need to be carfully defined in the modelling process. The identification of the driving 
factors of land-cover changes requires to study the land use on a quite long period. A set of 10 satellite 
images (9 SPOT images and 1 IRS-LISS III —2 per year over 5 years since 1996—) has been acquired, 
pre-processed and classified. Winter land cover change trajectories were produced by merging successively 
all classifications [2]. All this data have been integrated in a GIS (Geographic Information System) to 
identify the crop successions spatially and the land-cover changes between winters on the field scale. A 
statistical analysis and a meeting with the agricultural experts provided four main driving factors of 
land-cover changes, namely the field size, the crop successions, the agro-environmental actions and the 
distance of the fields from farm buildings. All this factors explain the winter land-cover distribution in the 
categories “Bare soil” or “Covered soil”. Then, a hierarchization of the identified driving factors of land- 
cover change was needed in the fusion process to predict the future land-cover (Mass belief assignment 


to the sources of evidence), to assess the respective “weight” of each explicative factors. 


17.2.2 Hierarchization of the factors of land cover change 


The mutual information between the variables has been used to hierarchize the explicative factors of land- 
cover change. The mutual information analysis is based on the information theory [3]. It is used to outline 
relations between the variables [4]. For this study, three indicators have been chosen to characterize the 


relationship between variables that may explicit the land cover evolution between the winters. 


e Entropy H: the main property of the information concept is that the quantity of information is 


maximum when the events are distributed uniformly. It allows to calculate the information quantity 
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between the set of events. 
N 


H => pilogpi, 


i=1 
with N number of possible events and p; probability of event 2. 

Mutual Information Z: it represents the mutual information between two variables X and Y; it is 
obtained through the difference between the entropy H of X, Y and the joint entropy H(X,Y) as 
follows. 


T(X, Y) = H(X) + H(Y) — H(X, Y). 


Redundancy R: It is issued from the entropy and the mutual information. It measures the hetero- 


geneity rate of two variables X, Y. 
T(X, Y) 
R=: 
H(Y) 


The process provides a hierarchization of the information quantity for the explicative variables with 


the variable to explain. The results of the mutual information test (Table[[71) show that the most repre- 


sentative variable is “Crop successions (1996-2002)”, followed by “Size of the fields”, “Agro-environmental 


actions” and “Distance from farm buildings” in decreasing representative order. These results allow to 


optimise the mass belief assignment for the hypotheses “Bare soil” and “Covered soil”, in comparison 


with an empirical “expert knowledge” method. 


Cases | A 


Distance from 1255 (67.6 %) 
0.14 % 
farm buildings 601 (32.4 %) 


Agro-environmental 1619 (87.2 %) 
a 0 


actions 237 (12.8 %) 
1 1.5 h 151 1. 
Field size Eran 0.97 % 


(1906-2002) 


Table 17.1: Explicative variables hierarchization with the mutual information analysis. 
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Column Np(%) of the Table[[ZJJindicates the numbers Np of fields (and their percentage). Column 
5 of the table indicates the values of redundancy R and column 6 the values of mutual information Z. 
In the last row (i.e. crop rotations during 1996-2000) of Table [ZI] six cases have been identified and 


correspond to 
1. (SC W) : soils covered during all winters 
2. (BS 1W) : bare soil during one winter 
3. (BS 2W) : bare soil during two winters 
4. (BS 3W) :bare soil during three winters 
5. (BS 4W) : bare soil during four winters 


6. (BS 5W) : bare soil during five winters 


17.3 Land cover prediction with the Dempster-Shafer Theory 


The theory of evidence proposed by Dempster was developed by Shafer in 1976 and the basic concepts 
of this theory have often been exposed [5] [6]. Detailed applications of the Dempster-Shafer theory can 
be found in [7]. Previous applications of the DS theory for our study [I] showed that 45% of the infor- 
mation sources were highly conflicting and generate misprediction results. Performances decrease when 
the conflict between the evidences is rising (k < 0.6). In our case, only 75% of the fields concerned by 
a high degree of conflict are correctly predicted. On the contrary, results become clearly better (91% of 


right prediction) when the conflict is low (k < 0.2). 


Several methods that attempt to make the fusion operators more reliable in considering the different 
sources of conflict may be found in {8} [9] 10) [11]. No optimal techniques exist yet, even if an approximate 
adjustment of the fusion threshold can be successful for some applications. In order to deal with the 


conflict between the information sources, we have applied here a method based on the source weakness. 


17.3.1 Basic belief assignment 


The assignment of basic beliefs (membership function shape) on the selected indicators is assigned by 
experts and from the evidence image distribution (Fig. LEZIJ). They are adjusted and validated with 
past-observed data and expert’s knowledge. Table[[7JJillustrates this stage in including the uncertainty 
through mass function affectation. For each evidences, denoted B for “bare soil”, C for “covered soil”, 
and BUC for “Bare soil or covered soil”, classes are defined in order to support one of the hypotheses 


B,C or BUC. 
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Figure 17.1: Evidence image distribution for each hypothesis. 
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Table 17.2: Affectation of the belief masses for the DS theory. 
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17.3.2 Conflict managing with the source weakness 
17.3.2.1 Principle 


Sources weakness method (i.e. discounting technique presented in chapter [consists in taking in account 
the reliability of the evidences by using reliability factor a for each source as a value such as 0 <a <1. 
This way, a source may be considered as totally reliable if a = 1, or on the contrary completely unreliable 


if a = 0. Damping rule is defined as follows: 


The weakness process is performed when the conflict is too high (relatively to a threshold, such as k < 0.4). 


Two rules have been investigated: 


e qa is set to a value so that the source does not interfere in the decision process. Then, 


mM’ (Obare soil) = 0.01 
mM’ (covered soil) = 0.01 
m’ (Obare soil U covered soil ) = 0.98. 


e q is set to a value linked to the conflict level k. So that the more the conflict, the more the weakness. 
We remind the conflict between two sources is defined as: 


k= Y m(A)m2(B). 


ANBAD 
17.3.2.2 Results and partial conclusion 


The results provided with this method are a little better than the simple application of the DS theory for 
the hypothesis “bare soil” since 84 fields are correctly predicted against 73 for the DS. But the analysis 
of the results showed that the conflict does not necessary take place in the mispredictions for the “bare 
soil” hypothesis. Also, Plausibility-Belief interval can not be helpful for the accuracy of the predictions. 
Then, an ambiguity between the sources must be taken into consideration in the process. Than is why, 


prediction process has been moved to the DSm theory in order to deal with paradoxical. 


17.4 Land cover prediction with DSmT 


The Dezert-Smarandache theory (DSmT) can be considered as a generalization of the Dempster-Shafer. 
In this new theory, the rule of combination takes into account both uncertain and paradoxical information, 
see chapter[Jof this book and [12]. Let be the simplest frame of discernment O = [Obare soil; Ocovered soil } in- 


volving only two elementary hypotheses with no more additional assumptions on Obare soil and Ocovered soil- 
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DSm theory deals with new basic belief assignments m(-) € [0,1] in accepting the possibility for para- 


doxical information such that: 


MÍObare soil) + M(Ocovered soil) + M(Obare soil U Ocovered soil) + M(Opare soil N Ocovered soil) = 1. 


Recently, a hybrid rule of combination issued of the DSm theory has been developed by the authors of 
the theory, see chapter Hof this book. The fusion of paradoxical and uncertain evidences with the hybrid 
DSm rule of combination combines several masses of independent sources of information and takes into 
consideration the dynamics of data sets. Thus, hybrid DSm model can be considered as an intermediary 
model between the DS and the DSm theory. The capacity to deals with several hyper-power set makes 


the hybrid model an interesting alternative in various fusion problems. 


17.4.1 Mass belief assignment 
17.4.1.1 Fuzzy mass belief assignment 


the mass belief assignment follows the same process as the DS theory. Nevertheless, a fuzzy mass belief 
assignment is here applied for two sources of evidence: “size of fields” and “distance from farm buildings” 
because of their specific characteristics (Fig. [17 1). For the variable “Size of fields” for example, the size 
evolves to 0.05 to 7.7 ha. Then, a continuous mass belief affectation appears pertinent for fusion process, 
by integrating paradoxical information when experts had introduced threshold instead. It is achieved by 


smoothing the actual bi-level assignment (Fig. ([7.2). 
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Figure 17.2: Fuzzy mass belief assignment for the evidences “Distance” and “Field size”. 


17.4.1.2 Contextual damping of source of evidence 


since the conflit level between sources is not necessary involved in the misprediction for the “bare soil” 
hypothesis, a contextual damping strategy is applied depending on the decision that is about to be taken. 
Actually, we consider that when the decision is about to be taken to the “bare soil” hypothesis, distance 


to farm and field size are completely paradoxical when crop rotation belongs to class 1 or 2. Furthermore, 
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when the decision is to be taken to the “covered soil” hypothesis, all the sources become paradoxical when 
crop rotation is greater than 3 (bare soil during two winters at least). 


In order to make sources of evidence paradoxical, a partial damping is applied as follows: 





m Obare soil) =a MiObare soil) 

m Ocovered soil) = B M(Ocovered soil) 

m Obare soil U Ocovered soil) = M/(Obare soil U Ocovered soil) 

m’ Obare soil N Ocovered soil) =l-a M(Obare soil) BA B M(Ocovered soil) = M(Opare soil U Ocovered soil). 


The couple (a, 3) allows to remove the mass of an hypothesis to the benefit of the paradoxical. Here, 
(a, 3) = (0.1, 1) is applied when the decision “bare soil” is about to be taken with crop rotation of 1 or 2 
(bare soil during no more than one winter). Also, (a, 8) is set to (1, 0.1) when deciding a “covered soil” 
while crop rotation is greater than 3 (bare soil during 2 winters at least). 

Here, this contextual partial damping allows the DSm rule to take into consideration a kind of contional 


mass assignment. 


17.4.2 Results 


The application of a contextual DSm rule of combination provides better results for the hypotheses “bare 
soil”. 121 fields (Table [7.4.2) are correctly predicted against 73 with the DS and 84 with the source 
weakness process. The “bare soil” hypothesis still generates a high level of mispredictions, which is not 
the case for the “covered soil” hypothesis. Several factors can explain the weak rate of right prediction for 
the hypothesis “Bare soils”. It is strongly linked to the high spatio-temporal variability of the land-use. 
Actually, an important number of fields covered with meadows during four or five years are ploughed in 
autumn and re-integrated in a cycle of crop successions. This kind of change is difficult to model since it 
can be due to unexpected individual human decisions, or exceptional and isolated weather-events. The 
spatial distribution of the results can be analyzed on the Fig. [73] The west part of the watershed 
corresponds to more intensive system farming than the east part. In the context of intensive system, 
the variability of land cover changes is higher than the others systems, it depends mostly on economics 
constraints that are difficult to model. On the contrary, the south part of the watershed is characterized 
by dairy milk production system. In this part of the watershed, the land cover evolution is better known 
and highly depends of the crop successions. Its integration into DSm theory is easier and the prediction 


process yields finest results. 
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Table 17.3: Performance of hybrid DSm rule for land prediction 
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Figure 17.3: Prediction performance with the hybrid DSm rule on the Yar watershed (Brittany). 
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17.5 Conclusion 


Two studies have been analyzed in this chapter for the prediction of land cover on a watershed subject to 
environmental problems. The land cover prediction with DS proved to have limitations with the increase 
of conflict between the sources of evidence that support land cover hypotheses. Several modifications may 
be used such as source weighting or the Hedging methods, but with no benefit in our case. To manage 
the conflict, the DSm has been applied with a little improvement of the accuracy of predictions. Actually 
conflict may not explain by itself all the bad decisions since the sources of evidence may induce all to- 
gether a wrong decision. That is why, a contextual fusion rule appeared necessary for this environmental 
problem where information sources can be paradoxical or/and uncertain. This new fusion process re- 
quired first the identification of the driving factors of land cover changes. Then, a mass belief assignment 
is built for the two hypotheses “covered soil” and “bare soil” through expert knowledge and a mutual 
information analysis that yield a hierarchization of the source of evidences. A fuzzy affectation is per- 
formed for two of the information sources and a “contextual” combination rule is applied to manage the 
uncertainty and the paradoxical characteristics of the information sources into the DSm decision process. 
The results for the “bare soil” hypothesis, which still generates too many mispredictions, are better than 
the prediction through DS decision rule (46% of correct “bare soil” predictions against 36% issued from 
the previous study). The hypothesis “covered soil” yields 78% of right prediction; this difference between 
the hypotheses can be explained with the weak rate of bare soil on the watershed and especially with 
the high variability of the land cover changes that characterized the intensive farm systems located on 
the north-west part of the watershed. Nevertheless, the fusion process appears to be robust and doesn’t 
require specifics data as input. Thus, prediction system developed with the DSm theory can be apply 
on different watersheds in Brittany and provides a useful tool for assessing and planning land use. The 


knowledge of land use is one of the key for restoring water quality intensive agricultural regions. 
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Chapter 18 


Power and Resource Aware 


Distributed Smart Fusion 


Shubha Kadambe 
HRL Laboratories, LLC 
3011 Malibu Canyon Rd., Malibu, CA 90265, USA 


Abstract: Large distributed sensor networks (DSN) with disparate sensors, pro- 
cessors and wireless communication capabilities are being developed for a variety of 
commercial and military applications. Minimizing power consumption of the nodes 
is a critical issue to their good functioning during the mission or application, to 
reduce their size and weight, and their cost so that their deployment is economically 
viable. In this chapter, we describe a robust, flexible, and distributed smart fusion 
algorithm that provides high decision accuracy and minimizes power consumption 
through efficient use of network sensing, communication, and processing resources. 
Our approach, developed on information theory-based metrics, determines what net- 
work resources (sensors, platforms, processing, and communication) are necessary to 
accomplish mission tasks, then uses only those necessary resources. It minimizes the 
network power consumption and combines valuable information at features and deci- 
sion level using DSmT. We demonstrate the proposed optimal, fully autonomous, 
smart distributed fusion algorithm for target detection and classification using a 
DSN. Our experimental results show that our approach significantly improves the 
detection and classification accuracy using the required high quality sensors and fea- 


tures, and valuable fused information. 
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18.1 Introduction 


patially distributed network of inexpensive, small and smart nodes with multiple onboard sensors 
S an important class of emerging networked systems for various defense and commercial applica- 
tions. Since this network of sensors has to operate efficiently in adverse environments using limited 
battery power and resources, it is important that appropriate sensors process information hierarchically 
and share information only if it is valuable in terms of improving the decision accuracy such that highly 
accurate decision is made progressively. One way to address this problem is to activate only those sensors 
that provide missing and relevant information, to assess the quality of information obtained from the ac- 
tivated sensors (this helps in determining the sensor quality), to assess the value of obtained information 
in terms of improving the decision (e.g., target detection/track) accuracy, to communicate only relevant, 
high quality and valuable information to the neighboring nodes and to fuse only valuable information 
that aid in progressive decisions. Information theoretic approaches provide measures for relevance, utility, 
missing information, value of information, etc. These measures help in achieving hierarchical extraction 
of relevant and high quality of information that enable in selection/actuation of relevant sensors and 
dynamically discard information from noisy or dead sensors and, progressive improvement of decision 
accuracy and confidence by utilizing only valuable information while fusing information obtained from 
neighboring nodes. In this chapter, we describe a minmax entropy based technique for missing informa- 
tion (feature) and information type (sensor) discovery, within class entropy based technique for sensor 
discrimination (i.e., quality assessment), mutual information for features quality assessment and, mutual 
information and other measures for assessing the value of information in terms of improvement in decision 
accuracy. In addition, we briefly describe how high quality, relevant and valuable information is fused us- 
ing a new theory - DSmT which provides rules for combining two or more masses of independent sources 
of information that is dynamically changing in real time which is essential in the network of disparate 


sensors that is considered here. 


To the best knowledge of this author there is no study on sensor discrimination using within class 
entropy metric is reported even though, there is one study on using mutual information for selecting a 
subset of features from a bigger set that is described in [2]. The technique described in this chapter uses 
within class entropy as a metric to assess the quality (good vs. bad) of a sensor. Unlike our technique, 
the technique in [2] is static in nature and cannot handle the case where the dimensionality of the feature 
set varies. In [15], the author shows that in general by fusing data from selective sensors the performance 
of a network of sensors can be improved. However, in this study, no specific novel metrics for the feature 
discovery and feature/sensor discrimination were developed unlike in this chapter. In [I0], techniques to 
represent Kalman filter state estimates in the form of information — Fisher and Shannon entropy are pro- 


vided. In such a representation it is straightforward to separate out what is new information from what is 
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either prior knowledge or common information. This separation procedure is used in decentralized data 
fusion algorithms that are described in [L0]. However, to the best knowledge of this author no study has 
been reported on using minmax entropy principle for the feature and information type discovery. Fur- 
thermore, to our knowledge the proposed value of information based fusion is not studied by others and is 
another significant contribution of this chapter. In addition, the significance of this study is the applica- 
tion of feature discovery and sensor discrimination in awakening the required sensor and in the formation 
of a cluster of distributed sensors to reduce the power consumption, to improve the decision accuracy and 
to reduce the communication bandwidth requirements. This chapter is a comprehensive of our studies 


reported in [6]171[8] with the addition of application of DSmT for fusion at both feature and decision levels. 


In the next section, proposed techniques are described. The simulation description and experimental 


results are provided in section[[83] Conclusions and future research directions are provided in section 


[15.4] 


18.2 Description of proposed research 


18.2.1 Discovery of missing information 


In the case of applications of a distributed network of disparate sensors such as (a) target detection, 
identification and tracking, (b) classification, (c) coalition formation, etc., the missing information could 
correspond to feature discovery. This helps in only probing (awakening) the sensor node that can provide 
the missing information and thus save power and processing by not arbitrarily activating nodes and by 
letting the unused sensor be in the sleep mode. We apply the minmax entropy principle described in [9] 
for the feature discovery. The details of estimation of missing information in other words feature discovery 


and information type using the minmax entropy principle are as follows. 


18.2.1.1 Minmax entropy principle 


Let N given values corresponds to n different information types. Let zij be the j-th member of i-th 
information type (where the information type is defined as a sensor type that gives similar information 


measures) so that 


n 
p(y Fa ee «= m=WM (18.1) 
i=1 
Then the entropy for this type of classes of information is: 
n mi 


H=- yom where no yy (18.2) 


i=1 j=1 i=1 j=l 
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Let T; = Dja zij- Using this, H can be written as: 


AT LG T 
H= —H; — —ln— =H H 18. 
2, F 2, 7 = Hw + Hp (18.3) 
— Zij Žij. ; eo 
where H; = — 5 T ln 7 $ the entropy of values that belong to information 7. 
j=l 


In the equation above, Hw and Hg are entropy of within classes (information types) and between 
classes, respectively. We would like types of information to be as distinguishable as possible and we 
would like the information within each type to be as homogenous as possible. The entropy is high if the 
values belonging to a type (class) represent similar information and is low if they represent dissimilar 
information. Therefore, we would like Hg to be as small as possible and Hw as large as possible. This 


is the principle of minmax entropy. 


18.2.1.2 Application of minimax entropy principle for feature discovery 


Let z be the missing value (feature). Let T' be the total of all known values such that the total of all 
values is T + z. Let Tı be the total of values that belong to information type to which z may belong. 
Tı + z then is the total of that particular type of information. This leads to: 

/ Zij Žij z zZ 
— —— hn —— — n 

T+2z T+z T+z T+z (18.4) 


"n T; T; Ty +z, Tt+z 
Hg =— l — l 
B Do pag AT T+z T}? 


H = 




















Here X” denotes the summation over all values of i, j except that correspond to the missing informa- 
tion and y denotes over all values of i except for the type to which the missing information belongs, 


respectively. 


We can then estimate z by minimizing Hg/Hy or Hg/(H — Hg) or Hg/H, or by maximizing 
(H — Hg)/Hg or H/Hg. The estimates of z provide the missing information values (features) and 
information (sensor) type. From the above discussion, we can see that we will be able to discover features 
as well as type of sensor from which these features can be obtained. This has the advantage of probing 
the appropriate sensor in a DSN. The transfer of information and probing can be achieved in such a 
network by using network routing techniques. Before trying to use the newly acquired feature set from 
the estimated information type i.e., sensor, it is advisable to check the quality of the sensor to make 
sure that the sensor from which we are seeking the information is not noisy (not functioning properly) 
or “dead” to reduce the cost of processing. In a DSN this has an added advantage of reducing the 
communication cost. We measure (see next section) the quality (i.e. discriminate a good sensor vs. bad 


sensor) by using an information theoretic measure - the within class entropy. 
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18.2.2 Measure of consistency 


We measure relevance by measuring consistency. For this we have developed a metric based on within 
class entropy that is described in this section. Let there are N events (values) that can be classified in 
to m classes and let an event x;; be the j-th member of i-th where i = 1,2,...,m, j = 1,2,...,n; and 


N 7, ni =N. The entropy for this classification is: 


H= 5 Sppe) log( ) 


i=1 j=1 


= -Ð P rOpley loale(plas) 


i=1 j=1 


= -YPO Y pleu) ogles) — Do pl 800) Y Pleas) 


j=1 i=1 j=1 


E 
pli)plziz) 


m 


= Y p(i)Hi— Y pi) og(r@) 


i=1 


= Hw + Hp 


The penultimate equality comes from the definition of H; = — S p(i) S play) log(p(x;;) represent- 
i=1 j=1 
ing the entropy of a class ¿ and the total probability theorem, i.e. Dia p(zi;j) = 1. Hw is called the 


entropy within classes and Hp is called the entropy between classes. 


The entropy Hw is high if the values or events belonging to a class represent similar information 
and is low if they represent dissimilar information. This means Hw can be used as a measure to define 
consistency. That is, if two or more sensor measurements are similar then their Hw is greater than if they 
are dissimilar. Therefore, this measure can be used in sensor discrimination. Note that even though the 
definitions of within class and between class entropy here are slightly different from section [8.2.1] they 
are similar in concept. Note also that the minmax entropy measure that uses both within and between 
class entropies was used earlier in the estimation of missing information; but here, within class entropy 
is defined as a consistency measure that can be used in sensor discrimination or selection. These two 


metrics have different physical interpretations and are used for different purposes. 


18.2.3 Feature discrimination 


After making sure about the quality of sensor (the information type) from which missing information can 
be obtained, it is necessary to make sure that the observations (features) from that sensor does help in 
gaining information as far as the required decision is concerned. This step doubly makes sure that the 
estimated missing information is indeed needed. For this, we have developed metrics based on conditional 


entropy and mutual information which are described in the following two subsections. 
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18.2.3.1 Conditional entropy and mutual information 


Entropy is a measure of uncertainty. Let H(x) be the entropy of previously observed x events. Let y 
be a new event. We can measure the uncertainty of x after including y by using the conditional entropy 
which is defined as: 
A (aly) = H(2,y) — H(y) (18.5) 
with the property 0 < H(xly) < H(x). The conditional entropy H(x|y) represents the amount of 
uncertainty remaining about x after y has been observed. If the uncertainty is reduced then there is 
information gained by observing y. Therefore, we can measure the importance of observing estimated y 
by using conditional entropy. Another measure that is related to conditional entropy that one can use is 
the mutual information I(x, y) which is a measure of uncertainty that is resolved by observing y and is 
defined as: 
I(x, y) = H(z) — H(aly) (18.6) 


To explain how this measure can be used to measure the importance of estimated missing information 


(e.g., features) which is referred to as feature discrimination, an example is provided below. 


18.2.3.2 Example of feature discrimination based on entropy metrics 


Let A = {ax}, k = 1,2,... be the set of features from sensor 1 and let B = {b)}, l = 1,2,... be the set 
of features from sensor 2. Let p(az) be the probability of feature az and p(b,) the probability of feature 
bı. Let H(A), H(B) and H(A|B) be the entropy corresponding to sensor 1, sensor 2 and sensor 1 given 
sensor 2, respectively, and they are defined as [9]: 

1 
Cay 





H(A) = X pax) log( 
E (18.7) 
H(AIB) = H(A, B) — H(B) = Y p) H (AIh) = Y ph) Y plath) los apy) 

Here, the entropy H(A) corresponds to the prior uncertainty and the conditional entropy H(A|B) 
corresponds to the amount of uncertainty remaining after observing features from sensor 2. The mu- 
tual information I(A, B) = H(A)-H(A|B) corresponds to uncertainty that is resolved by observing B 
in other words features from sensor 2. From the definition of mutual information, it can be seen that 
the uncertainty that is resolved basically depends on the conditional entropy. Let us consider two types 
of sensors at node 2. Let the set of features of these two sensors be Bı and B2, respectively and let 
the set of features estimated by the minmax entropy principle described in the previous section be Bı. 
If H(A|B,) < H(A|B2) then 1(4, Bı) > I(A, B2). This implies that the uncertainty is better resolved 
by observing Bı as compared to Bz. This further implies that indeed the estimated Bı corresponds to 


features that help in gaining information that aid in the decision process of sensor 1 and Bz does not and 


hence, should not be considered. 
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Note that even though in the above example only two sensor nodes are considered for simplicity, this 
measure or metric can be used in a network of more than two sensors. In such a case, A would be a set 
of features that a node already has from other sensors in a cluster that it is a member of and B would be 
a new feature set that it receives from a different sensor type that it has not already received from and 
it may be a member or not a member of that particular cluster. If the mutual information increases by 
including the set of features B then we make a decision of including that sensor as part of this particular 
cluster if it is not a member. In case it is a member and the mutual information does not increase then 


it would be discarded from that particular cluster. 


18.2.4 Measures of value of information 


This section describes the measures of value of information that we have developed to determine when to 
fuse information from disparate sources. The value is in terms of improving the decision accuracy. Even 
though the mathematics of the metrics described below are not novel, the usage of metrics in the context 
of verifying value of information with respect to improving the decision accuracy (e.g., classification 


accuracy, detection accuracy) is new. 


18.2.4.1 Mutual information 


Mutual information defined in section [[8.2.3.1]can also be used as a measure of value. 


18.2.4.2 Euclidean Distance 


Unlike mutual information, Euclidean distance does not evaluate the amount of information available 
from a second source. It does, however, measure the similarity between two feature sets in Euclidean 
space. This value can then be used to determine when to fuse two sources of information, whether they 
are from different types of sensors on the same node or from same type of sensors different nodes. A 
simple measure, Euclidean distance is defined as: 
d= |X (a; — bi) (18.8) 
i 


where a;, b; and 7 are defined in Section {8.2.3.1 


18.2.4.3 Correlation 


Correlation is also a well known measure of similarity. We use the standard measure of correlation as 


defined by: 
_ Ella — pa)(b — m)] 
p= Ela — pa] E[b — uo] ee) 
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where Ha and pp are the means of feature sets a and b, respectively. Note that correlation is very closely 


related to mutual information, I(x, y) because (18.6) can be rewritten as: 


= p(ak, bk) 
I(z,y) = 2, Plan, i) lose ple) (18.10) 


18.2.4.4 Kullback-Liebler distance 


Finally, the Kullback-Liebler (KL) distance is derived from entropy, and again is a measure of the sepa- 


ration of two feature sets. It is defined as: 








D=Y~ pax) Jog (Pez), 4 X- v(bx) log (Pr), (18.11) 


a pO) G plar 


wa 


18.2.5 Fusion using DSmT 


Since in a network of disparate sensor nodes as is considered here, the sources of information are indepen- 
dent and changing dynamically based on which sensor and features are selected, for the smart distributed 
fusion we use the new theory of plausible and paradoxical reasoning - DSmT developed in [5]. This 
theory provides a hybrid DSm rule which combines or fuses two or more masses of independent sources 
of information and takes care of restraints i.e., of sets which might become empty at certain time or new 
sets that might arise at some other time. In a network of sensor nodes these situations arise (sometimes 
we discard the feature set or decision from the other nodes and sometimes we use features from different 
type of sensors based on how the scene is changing dynamically) and hence, the application of hybrid 
DSm rule for fusion is very appropriate. In addition, since fusion is not done at a centralized location 
but done locally dynamically based on the information received from the neighboring nodes, we propose 
to extend the decentralized dynamical fusion by combining dynamical fusion using the hybrid DSm rule 
for the chosen hybrid model M. Specifically, at the feature level fusion at each sensor node the frame 


under consideration at time t; will be 
A Š à , š 
O(t,) = {0, = acoustic sensor, 02 = seismic sensor, 93 = IR sensor location} 


and at decision level fusion, O(t;) = (01 = vehicle present, #2 = vehicle not present} in the case of a 


detection application and, 
O(t) = (0, = AAV, 02 = DW, 63 = HMMWV} 
where AAV, DW, and HMMWYV represent the vehicle types that are being classified) for the decision 


level fusion in the case of a classification application. 


Both detection and classification applications are described in section [[8.3.2] We derive basic belief 


assignments based on the observations (a) from the sensor type for feature level fusion, (b) from the 
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features extracted from the sensors’ signals for fusion at the decision level in the case of classification 
and detection applications. For example, m,(01) = 0 and m,(62) = 0, if the feature from the acoustic 
sensor (a) — energy is well above the threshold level in the case of the detection application. O(t;) changes 
as the observation is different based on the above described sensor and feature selection and results in 
O(t141). If we discard observations from a sensor (based on the feature discrimination algorithm explained 
above) then © diminishes and we apply the hybrid DSm rule to transfer the masses of empty sets to 
non-empty sets. If we include observations from a new sensor then we use the classical DSm fusion rule to 
generate basic belief assignments m;,, ,(.). For the decentralized decision level fusion at the current node, 
consider the @,(t;) obtained from the previous node and the O(t;) of the current node and apply the 
hybrid DSm rule by taking the integrity constraints in to consideration. These constraints are generated 
differently for the fusion between the sensors and for the fusion from node to node. The pseudo-codes 
which generate these constraints are given in section [18.3.2] For example, in the case of node to node 
fusion for classification application that is described in section [83.221] fuse_4class=1 will indicate 
to put the constraint 01 N 02 a 0, 61 N 63 a 0, 01 N 62N 03 M Ø at the current node if the classification at 
the previous node corresponds to 0, = AAV since if the vehicle at the previous node is AAV, the vehicle 


at the current node which is very close to the previous node has to be AAV. 


18.3 Experimental details and results 


Above described algorithms have been applied for the feature discovery, sensor and feature evaluation 
(discrimination), cluster formation and distributed smart fusion in a network of both simulated radar 
sensors and a network of real disparate sensors and sensor nodes that are spatially distributed. First, in 
section [8.3.1] the results obtained using a simple simulated network of radar sensors is provided for the 
purposes of proving the concepts. In section [8.3.2] however, experimental results obtained by using a 


real DSN of disparate sensors is provided. 


18.3.1 Simulated network of radar sensors 


This network of sensors is used for tracking multiple targets. Each sensor node has a local and global 
Kalman filter based target trackers. These target trackers estimate the target states - position and ve- 
locity in Cartesian co-ordinate system. The local tracker uses the local radar sensor measurements to 
estimate the state estimates while the global tracker fuses target states obtained from other sensors if it 


is consistent and improves the accuracy of the target tracks. 


For the purposes of testing the proposed algorithms of this chapter, a network of three radar sen- 


sors and a single moving target with constant velocity are considered. Two sensors are considered as 
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good and one as bad. A sensor is defined as bad if its measurements were corrupted with high noise (for 


example SNR = -6 dB) or is biased. In the first set of examples the SNR of a good sensor is set to be 10 dB. 


In the case of simulation of a biased sensor, the bias was introduced as the addition of a random 
number to the true position of a target. The bias was introduced this way because the biases in azimuth 
and range associated with a radar sensor translate into measured target position that is different from 
the true target position. In addition, in our simulations, we assume that the sensors are measuring 
the target’s position in the Cartesian co-ordinate system instead of the polar co-ordinate system. The 
amount of bias was varied by multiplying the random number by a constant k i.e., measured position = 


(true position + k-randn) + measurement noise. 


First, the minmax entropy principle was applied to find the missing information, the appropriate 
sensor was probed to obtain that information, then the consistency measure — within class entropy was 
applied to check whether the new sensor type and the information obtained from that particular sensor 


is consistent with the other sensors. 


In the following two figures, within class entropy is plotted for features discovered from two unbiased 
sensors and, one biased and one unbiased sensor. The measurement noise level was kept the same for all 
three sensors. However, the bias k was set to 1.0 in Figure [8I] and was set to 2 in Figure [82] The 
within class entropy was computed for different iterations using the definition provided in the previous 
section. The probability values needed in this computation were estimated using the histogram approach 
which is elaborated below. From these two figures, it can be seen that the within class entropy of two 
unbiased sensors is greater than the within class entropy of one biased and one unbiased sensors. This 
indicates that the within class entropy can be used as a measure to discriminate between sensors or to 


assess the quality of sensors (to select sensors). 


Next, the conditional entropy and mutual information measures described in the previous section are 
used to make sure the estimated features obtained from the selected sensors indeed aid in the decision 


process. 


For this, the target states that were estimated from the measurements of a simulated radar at each 
sensor node using the local Kalman filter algorithm is used as feature sets. The estimated target states 
at each sensor node were transmitted to other nodes. For this simulation, only estimated position was 


considered for simplicity. 
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— Within class entropy of unbiased sensors 1 & 2 
— Within class entropy of one unbiased and one biased sensors 1 & 3 





Figure 18.1: The plot of within class entropy of sensors 1 & 2 (unbiased sensors) and, 1 (unbiased) and 


3 (biased). Bias constant k = 1 


— Within class entropy of unbiased sensors 1 & 2 
— Within class entropy of one unbiased and one biased sensors 1 & 3 





Figure 18.2: The plot of within class entropy of sensors 1 & 2 (unbiased sensors) and, 1 (unbiased) and 


3 (biased). Bias constant k = 2 
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We considered the estimated state vector as the feature set here. Since the goal of this simulation 
is proof of concept, the feature discrimination algorithm was implemented at sensor node 1 with the 
assumption it is a good sensor. Let the state estimate outputs of this node be A,. Let the state estimate 


outputs of a second sensor correspond to Bg and a third sensor correspond to By. 


For the computation of entropy, the probability values are needed as seen from the equation above. 
To obtain these values, ideally, one would need probability distribution functions (pdfs). However, in 
practice it is hard to obtain closed form pdfs. In the absence of knowledge of actual pdfs it is a general 
practice to estimate them by using histograms [II]. Researchers in signal and image processing use this 
technique most commonly [13]. Another practical solution to estimate the probability and conditional 
probabilities is by using the counting or frequency approach [12]. However, it is well known that the 
estimates of probabilities and conditional probabilities are more accurate if they are estimated by using 
the pdfs that are approximated from the histograms. Therefore, we use the histogram approach here. In 
order to obtain the histograms, initially, we need some data (features) to know how it is distributed. For 
this purpose, it was assumed that initially N state estimate vectors were accumulated at each sensor node 
and this accumulated vector was transmitted to other nodes. Note also that the accuracy of probability 
estimates using the histogram approach depends on the amount of accumulated (training) data. Also 
for non-stationary features, it depends on how often the histograms are updated. In practice, since the 
training data is limited we have set N to 10 in this simulation. To take care of the non-stationarity of 
the features, initially, we wait till N estimates are obtained at each node. From then on we update the 
histograms every time instant using the new state estimate and previous nine state estimates. At each 


time instant we discard the oldest feature (oldest state estimate). 


To get the probability of occurrence of each feature vector, first the histogram was computed. For 
this, bin size Npin of 5 was used. The center point of each bin was chosen based on the minimum and 


maximum feature values. In this simulation the bin centers were set as: 


max(feature values) — min(feature values) 


18.12 
Nbin ( ) 


min(feature values) + (0 : Npin — 1) - 


Since the histogram provides the number of elements in a given bin, it is possible to compute the 


probabilities from the histogram. In particular it is computed as: 


Number of elements in a particular bin 
Totalnumberofelements 


Hence, from these histograms, probabilities were computed. Similarly, conditional probabilities of 
p(A,|B,) and p(A,|B,) were computed from the conditional histograms and these conditional probabilities 


are plotted in Figures [8.3] and [13.4] respectively. 
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Figure 18.3: Conditional probability of position estimates of sensor 2 at node 2 given position estimates 


Figure 18.4: Conditional probability of position estimates of sensor 3 at node 3 given position estimates 


of sensor 1 at node 1 





of sensor 1 at node 1 
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Each colored line in these two plots represents one conditional probability distribution function. Note 
that both A and B are vectors and there would be one pdf for each member of set A. Since we have 


chosen bin size as 5 there would be 5 members in set A and hence, there are 5 subplots in Figures [8.3] 


and [18.4] 


Using these probabilities, conditional entropies H(A,|B,) and H(4,|B»), and mutual information 
I(Ag, By) and I(A,, By) were computed using the equations mentioned above for one set of features from 
sensor at node 2 and node 3. After this kind of initial computation of probabilities, conditional entropy 
and mutual information, whenever a sensor estimates a new feature it is replaced by the oldest feature 
in the feature set and transmitted to other sensors. Subsequently, histograms, probabilities, conditional 
entropy and mutual information were computed using this updated feature set. This would take care 
of the non-stationarties of features. Thus each new feature can be verified to make sure it is relevant 
in terms of aiding in the decision process (e.g., track accuracy) and it is obtained from a good sensor. 


Therefore, this technique is dynamic in nature. 


18.3.1.1 Versatility of the algorithm 


To verify the versatility of this algorithm we considered a different feature sets namely, the sensor measure- 
ments itself instead of the position estimates and the first difference in position estimates. We performed 
similar simulation that is described above using these two types of feature sets and the associated his- 
tograms for the probability, entropy and mutual information computations. In these two cases also we 


always obtained [(A,,.B,) > I(Ag, By) for all the 100 runs of Monte Carlo simulations. 


18.3.1.2 Sensitivity of the algorithm for sensor discrimination 


Next, noise level at sensor 2 and 3 were varied to determine the sensitivity of the sensor discrimination 
algorithm. The SNR at sensor 1 was fixed at 10 dB. The algorithm was able to discriminate between 
good and bad sensor 100 % of the time when the noise level at sensor 2 is 8 dB and at sensor 3 is 3 
dB. The algorithm was able to discriminate about 80 % of the time if the noise level at sensor 3 is 5 
dB when the noise level at sensor 2 is fixed at 8 dB. If the noise level at both sensor 1 and 2 is 10 dB 
then the algorithm was able to discriminate 100 % of the time when the noise level at sensor 3 is 5 dB. 
However, when the noise level at sensor 3 was changed to 7 dB, the percentage of correct discrimination 
was dropped to 82 %. Therefore, if the minimum difference between the noise level at sensor 2 and 8 
is 5 dB then the discrimination accuracy is 100 %. If the noise level at both sensor 2 and 3 is close (a 


difference of 1 dB) then the algorithm cannot discriminate as expected. 


18.3. EXPERIMENTAL DETAILS AND RESULTS 397 


18.3.1.3 Mutual information versus track accuracy 


To check indeed when mutual information metric is used to evaluate the information gain by observing 
the estimated missing features (information) and it aids in the improvement of the accuracy of decision 


(e.g., track accuracy), the following experiment was conducted. 


As before, mutual information I(A,,B,) and I(A,, B,) was computed using measurements as feature 
set. If [(Ag, Bg) > I(Ag, Bb) then the state estimates from the good sensor was fused with sensor 1 using 
the global Kalman filter algorithm and the DSm combination rule that is described in section[18,2,5] The 
position estimation error was computed by comparing the fused state estimate with the true position. To 
compare the track accuracies, the state estimates from the bad sensor and good sensor were also fused. 


The position estimation error was then computed the same way as explained above. 


In Fgure[T3.5] the position estimation error using the fused state estimates of sensor 1 & a good sensor 
(blue plot) and sensor 1 & a bad sensor (red plot) are plotted. From this figure, it can be seen that the 
track accuracy after fusing state estimates from good sensors (1 & 2) is much better than fusing state 
estimates from a good sensor and a bad sensor (1 & 3). This implies that better mutual information 


correlates to better track accuracy. 


In Figure[8.6] the position error is plotted for the case when the noise level at sensor 2 and 3 differs 
by 5 dB. In this case also it can be seen that the track accuracy is better when the state estimates from 
good sensors is fused as compared to the track accuracy of fused state estimates of a good sensor and a 


bad sensor. 


We then form a cluster of sensors that are consistent and apply the mutual information metric. 
We have shown above that by fusing information from sensors when the mutual information increases, 
the decision accuracy improves. We transmit the fused decision (which requires much lower bandwidth 
compared to the transmission of decision of each sensor to every other in the network) to other clusters 


of sensors and thus reduce the communication bandwidth requirement. 
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—— sensor 1 (good) & sensor 2 (good) 
—— sensor 1 (good) & sensor 3 (bad) 





Figure 18.5: Track accuracy comparison - Noise level at sensor 1 and 2 = 10dB and at sensor 3=0dB 


—— sensor 1 (good) & sensor 2 (good) 
— sensor 1 (good) & sensor 3 (bad) 





Figure 18.6: Track accuracy comparison - Noise level at sensor 1 and 2 = 10dB and at sensor 3=0dB 
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18.3.2 A real network of spatially DSN with disparate sensors 


The proposed algorithms described in section [13,2] was implemented on sensor nodes that consists of 
multiple sensors, a communication radio and a Sharc processor. These sensor nodes were distributed in 
a rough terrain such as a desert. This network was used in detecting, tracking and classifying targets. 
Even though we verified the algorithms that estimate the missing information, sensor selection, sensor 
and feature assessment in this network of sensor node, in the following subsections, we are concentrating 
on the value of information based smart fusion that is described in sections [8.2.4] and [82.5] since the 
experimental results for the other algorithms are provided in the last section. We provide the experimental 
details and the results. We begin with the review of detection and classification algorithms that were 


used in this context. 


18.3.2.1 Review of algorithms used to check the value of information based smart fusion 


The metrics described in section 2.4 are used to measure the value of information obtained from other 
sources such as multiple sensors on a single node and from the neighboring nodes in the context of target 
detection and classification. For target detection, energy based detector was used and for classification, 
maximum likelihood based classifier was used. As mentioned before the value of information is in terms 
of improvement in the decision accuracy which corresponds to classification accuracy for a classifier and 
detection accuracy or probability of detection for a detector. Note that in this study, we did not develop a 
classifier or a detector; however, used those developed by others since the goal of this part of the study is 
to develop measures of value of information and verify them in terms of improvement in decision accuracy 
when they were used to make a decision of whether to fuse information obtained from the other source or 


not. In the following two sections we review the classifier and the detector that were used in this study. 


18.3.2.1.1 Maximum likelihood based classifier The classifier we used for the verification of 
measures of value of information in terms of improving the decision accuracy is a maximum likelihood 
based classifier developed by the University of Wisconsin [I6] as part of DARPA’s sensor information 
technology (SensIT) program. For a given training features and target labels a Gaussian mixture model 
is determined during the training phase of the classifier. During testing the distance between the test 
feature vector and ith class Gaussian mixture is computed. This corresponds to negative log likelihood. 
Then a priori probability is used to obtain the maximum a posterior classification. The features’ set that 
is used here consists of twenty features from the power spectral density. This is computed using 1024 
FFT. The feature set is collected by summing up the values over equal length segments of the power 
spectrum. For the acoustic and seismic sensors the maximum frequency used was 1000 and 200 Hz, 


respectively. 
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18.3.2.1.2 Energy based detector An energy based detector is also used for the verification of 
improvement in decision accuracy when the value of information based fusion architecture is used. This 
detector is developed by BAE, Austin [3]; also as part of the SensIT program. A brief description of this 


detector is provided below. 


For every block of a given signal the energy of the down sampled version of the power spectral density 
is computed. For the computation of the power spectral density, 1024 point FFT is used. This energy is 
compared with a threshold value. Whenever the energy is above the threshold it was declared that the 


target was detected. The threshold value is adaptively changed based on the background energy. 


18.3.2.2 Experimental details 


The above described classifier and detector, and measures of value of information and the fusion algorithm 
which uses these measures while deciding when to and when not to fuse information were implemented 
and were tested using real data that was obtained by distributing sensor nodes along the east-west and 
south-north road at Twentynine Palms, CA during one of the field tests (SITEX’02) as shown in Figure 
These sensor nodes are manufactured by Sensoria. On each sensor node, three sensors - acoustic, 
seismic and IR sensors, a four channel data acquisition board and a processing board are available. These 


nodes also have communication capabilities. For more details on the sensor node, refer to [TA]. 
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Figure 18.7: Sensor node distribution at Twenty nine Palms, CA 
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Three vehicles — AAV, Dragon Wagon (DW) and HMMWV were driven along the east-west and north- 
south road as shown in Figure[13.7] while conducting the experiments. In this figure, nodes placements are 
also provided. Totally twenty four nodes were considered in our experiments. We used both seismic and 
acoustic data from these nodes when it is appropriate. In the next section, the classification experimental 
details and the results are provided and in section [18.3.2.2.2] the detection experiments and the results 
are provided. In both these sections experimental details and results are provided with and without value 


of information based fusion technique that was developed in this study. 


18.3.2.2.1 Classification experiments First, acoustic data from each node is considered. The 
maximum likelihood classifier is trained using only acoustic data from individual nodes. The challenges 
in the classification experiments are threefold: 1) when to reject a source of data, 2) when to propagate 
data between sequential nodes, and 3) when to share individual sensor data within the same node. Using 
only acoustic data, we investigated the effectiveness of the four measures of value of information outlined 


in Section [[8.2.4]- mutual information, Euclidean distance, correlation, and Kullback-Liebler distance. 


In addition, we investigated two methods of using these measures. When evaluating the effectiveness 
of fusing two sources of data, is it better to compare the two sources with each other or with the stored 
training data? To answer this question, we devised several similarity measures to measure the closeness 
of two data sources. We calculated these measures between data at all sequential nodes. Then for each 
similarity measure, we computed its correlation with correct classification performance at each node. We 
call this the performance correlation. The average performance correlation over all nodes for each class 
of data using previous node similarity measures is shown in Figure [13.8] Next, we calculated the same 
similarity measures between the data at each node and the data stored in the training sets. Again, for 


each similarity measure, we computed its correlation with correct classification performance at each node. 


The average performance correlation over all nodes for each class of data using training set similarity 


measures is shown in Figure [8.9] 


Inspection of Figures [13.8] and show that the similarity measures Euclidean distance and cor- 
relation are more closely aligned with correct classification performance than either mutual information 
or Kullback-Liebler distance. In practice, however, we found that the Euclidean distance outperformed 
correlation as the determining factor in fusion decisions. Furthermore, comparing Figures [13.8] and [13.9] 
shows that using the training set for similarity measures is more effective than using the data from the 
previous node in the network. We found this to be true in practice as well. Subsequent work with the 
seismic data echoed the findings of the acoustic data. Note that even though we use the training data to 


make the fusion decision, we perform the actual data fusion with current and previous node data. 


402 CHAPTER 18. POWER AND RESOURCE AWARE DISTRIBUTED SMART FUSION 


distance 
rho of means 
mean of rhos 
mut info 
kullback 








Figure 18.8: Performance correlation of previous node data 
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Figure 18.9: Performance correlation of training class data 


18.3. EXPERIMENTAL DETAILS AND RESULTS 403 


Rejection of bad data Sometimes one node or one sensor can have bad data, in which case we 
prefer to reject this data rather than classify with poor results. The feature discrimination algorithm is 
used for this. By rejecting the data, we did not fuse it with any other data, pass it on to any other node, 
nor even compute a classification at that source. Our method resulted in the rejection of several sources 


of bad data, thus improving the overall classification results as shown in Figures [13.10] and [8.11] 
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Figure 18.11: Performance of node fusion for the DW with seismic sensor data 
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Node to node fusion The fusion decision can be made with a threshold, i.e. if the distance 
between two features sets is below some value, then fuse the two feature sets. The threshold value can 
be predetermined off-line or adaptive. We sidestep the threshold issue, however, by basing the fusion 
decision on relative distances. To do so, we initially assume the current node belonged to the same class 
(aka the target class) as the previous node and employ the following definitions. Let x, be the mean 
vector of the current node data. Let £nf be the mean vector of the fused data at the current node. 
Let £e be the mean vector of the target training class data. Let x.,, £e be the mean vectors of the 


remaining training classes. A Euclidean distance ratio is defined as: 
Taist = de, / min(de, , dez) (18.13) 


where de, is the Euclidean distance (18.8) between x, and z¿,. We then use the following pseudocode to 
make our fusion decisions. 


if (rast <= 1.0) 
fuse_4class=1; fuse_4carry = 1; 
class_ind = classify xn; 
if (class_ind >= 70%) check class_fuse; 


end 
else 
fuse_4class=0; fuse_4carry =0; 
if {(de1 <= 3561) & (de2 <= 35¢2) de (des <= 3502 )) 
class_ind= classify xn; 
if (class_ind = target class) fuse_4class = 1; 
if (class_ind >= 70%) 
fuse 4carry= 1; 
class_fuse = classify xy, 
if (class_ind > class_fuse) 
class_fuse = class ind; 
end 
end 
end 
else 
reject this data; 
end 
end 


There are two outcomes to the fusion decision. First we decide whether or not to fuse the data at the 
current node. If the current node has bad data, fusion can pull up the performance, however, we may not 
want to carry the bad data forward to the next node (the second fusion decision outcome). fuse_4class 
is a flag indicating whether or not to fuse for the current classification. fuse_4carry is a flag indicating 
whether or not to include data from the current node in the fused data that is carried forward. Based on 
this decision, the fusion of classification decision is achieved by applying the fusion algorithm described 
in section [8.2.5] In Figures [13.10] and [185.11] we show the correct classification improvement gained by 
fusing from node to node for the acoustic and seismic sensors, respectively. For the acoustic sensor we 
show classification results from the AAV data, while using DW data for the seismic sensor results. In the 
case of the acoustic data, the mean correct classification performance across all nodes increases from 70% 
for independent operation to 93% with node to node fusion across the network. Similarly, the seismic 


correct classification performance increases from 42% to 52%. 
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Fusion between sensors After fusion from node to node of the individual sensors, we look at 
the benefit of fusing the acoustic and seismic sensor data at the same node. To do so, we employ the 
following definitions. Let rgjs; be defined as in but with the new data types (a - acoustic, s - 
seismic, and as — a concatenated acoustic/seismic vector). Let xa be the mean vector of the current node 
acoustic data after fusion from node to node. Let xs be the mean vector of the current node seismic data 
after fusion from node to node. Let £as = £a concatenated with x, (dumb fusion). Let asf = smart 
fusion of £a with £s. Let £in be the data input to the classifier. Now, we employ two steps in the sensor 
fusion process as shown in the pseudocode below. In this case also for the fusion of features from two 
independent sources such as acoustic and seismic, DSm based technique described in section [8.25] is 


applied. First we employ a smart sensor fusion routine: 


indx = min(7_Gaist, 7 _Saist, Y_ASdis1) 
if (indx = 1) x= Xa; 

elseif (indx = 2) xn=Xs5 

elseif (indx = 3) x= Xas; 

end 


Next, we employ a final fusion routine: 


class acst= classify xa 

class_seis = classify x.: 

class_as_ dumb = classify Xas: 

class_as_smart= classify xa; 

if { (class_acst >= 70%) | (class_seis >= 70%) | (class_as_ind >= 70%) } 
class_final_fuse = max (class_acst, class_seis, class_as_dumb, class_as_smart) 

end 


Figure [13.12] shows the results of fusion at each stage in the form of a bar plot. The classification 
performance is averaged over all the nodes for each vehicle class. The correct classification performance 
improves at each stage of fusion processing as shown in Table[13.1] The results indicate that the fusion 


based on value of information helps in improving the decision accuracy at each node significantly. 


a ECO ECT 


Acoustic & seismic, independent 
Acoustic & seismic with fusion 


Table 18.1: Summary of classification performance 
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Figure 18.12: Average correct classification performance at each step in the fusion process 


18.3.2.2.2 Detection experiments For the detection experiments also both acoustic and seismic 
data were considered. First, only acoustic data from individual nodes were used. A threshold value was 
initially set which was varied adaptively based on the background energy. The power spectral density 
of acoustic data was computed using 1024 point FFT and it was downsampled by 8. The energy of the 
downsampled version of the power spectral density was computed. This energy was compared with the 
threshold value. If the energy was above the threshold value, it was decided that the target was detected. 
The time of detection and the confidence on detection were also calculated. The detection and time of 
detection were compared with the ground truth. If the target was detected when it is supposed to be 
and if the time of detection is within the region of interest then it was counted towards calculating the 
probability of detection. If the detection time is outside the region of interest (missed detection) and if 
a target was detected when it should not have been (false alarm) it was counted towards computing the 
probability of false alarm. The probability of detection and false alarm using only acoustic data from 
individual nodes without any fusion for AAV, DW and HMMWV are: 0.8824, 0.8677, 0.8382 and 0.1176, 
0.1323, 0.1618, respectively. Similarly, the probability of detection and false alarm using only seismic 
data from individual nodes without any fusion for AAV, DW and HMMWV are: 0.8030, 0.7910, 0.5735 
and 0.1970, 0.2090, 0.4265, respectively. 
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Next, the mutual information based value of information measure was used on the energy of power 
spectral density to make a decision of fusing data between sensors - acoustic and seismic on each individ- 
ual node. The detector was tested using the fused data on each node. The probability of detection and 
false alarm were computed as described above. The probability of detection of this intelligently fused 
data for AAV, DW and HMMWV is: 0.9394, 0.9105 and 0.8529, respectively. The probability of false 
alarm is not provided here because it is equal to 1 — probability of detection since both false alarm and 
missed detections are combined together. These results are summarized in Figure [TS.13]in the form of a 
bar graph. From this, it can be seen that the intelligent sensor data fusion based on value of information 
and DSmT significantly improves the detection accuracy. This type of fusion especially helps in difficult 
data as in the case of HMMWV. 
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Figure 18.13: Performance of a detector 
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18.4 Conclusions 


In this chapter, we have described how minmax entropy principle can be used in feature (missing infor- 
mation) discovery and the type of sensor (information type) from which this missing information can 
be obtained. Further, a consistency measure is defined and it has been shown that this measure can 
be used in discriminating or assessing the quality of sensors. Next, conditional entropy and mutual in- 
formation measures are defined and it has been shown that these two measures can be used in making 
sure that the estimated missing information or new feature set indeed help in gaining information and 
aid in decision process. Further more, we have introduced several measures for value of information. We 
have used these measures in deciding when to fuse information. For the fusion we have developed an 
algorithm using DSmT. We have proven the concept of all the measures and fusion by first considering a 
simulated network of radar sensors and then by considering a real network of spatially distributed sensor 
nodes which have multiple sensors on each sensor node. The experimental results indicate that (a) the 
minmax entropy principle can be used in estimating the missing information and information type and 
it can be used in the cluster formation; (b) the constancy measure based on within class entropy can be 
used in sensor discrimination; (c) the mutual information can be used in feature quality assessment and 
in evaluating the value of information; (d) the measures of value of information helps in smart fusion; 
(e) the distributed smart fusion significantly improves the decision accuracy. All these measures help 
in probing (awakening) the required sensor for the required missing information, only transmitting the 
valuable information when and where it is needed and fusing only valuable information. Thus, power 


and, computing and communication resources can be efficiently utilized. 
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